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Preface 


The Kepler conjecture asserts that the densest packing of three-dimensional Euclidean 
space by equal spheres is attained by the “cannonball” packing, or face-centered-cubic 
(FCC) packing, which fills space with density Far = 0.74048. This conjecture, 
formulated by Kepler in his booklet “Strena, seu, de Niue Sexangula,’ was published 
in 1611, exactly four hundred years ago. Notably, in 1900 Hilbert included the sphere 
packing problem in his famous problem list, as part of his 18th problem. More than a 
century later, ina landmark result, the Kepler conjecture was solved in work of Thomas 
C. Hales and Samuel P. Ferguson. An abridged version of their proof appeared in the 
Annals of Mathematics in 2005, followed a year later by the publication of a detailed 
proof. 

This book presents the Hales-Ferguson proof of the Kepler conjecture, together 
with supporting material and commentary. It begins with an introductory overview 
chapter, followed by a chapter on the local density approach to sphere packing bounds. 
This is followed by the six papers of Hales and Ferguson giving their detailed proof, 
as published in 2006 in a special issue of Discrete & Computational Geometry. Next 
comes a 2010 paper by Hales (with five other authors) making a slight revision to the 
2006 proof, and listing corrections. It concludes with two of Hales’s initial papers on 
the problem, published in 1997. All chapters except for the first are papers reprinted 
from Discrete & Computational Geometry. 

This book is divided into four parts, as follows. 


Part I: Introduction and Survey 
The editor has written the two chapters in this part. The first chapter is introductory 
and features a brief summary of the history of work on the problem, together with a 
description of Hales and Ferguson’s 1998 preprints, details of the peer review process 
of the Hales and Ferguson papers (which took eight years), and subsequent devel- 
opments. It also includes remarks on the reliability of the proof and the subsequent 
approach of Hales to obtain a formal proof of the Kepler Conjecture (in a formal 
logical system), entirely checkable by computer. 

The second chapter describes the general program of obtaining sphere packing 
upper bounds using local density inequalities, an approach that can be applied to 
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sphere packing in any dimension. It was published as a paper in 2003 in Discrete 
& Computational Geometry. After considering the general case, it specializes to 
the three-dimensional case, and desecribes the main features of the Hales-Ferguson 
local density inequality as presented in their 1998 preprints; the 2006 published proof 
established a slightly modified inequality. 


Part II: Proof of the Kepler Conjecture 
These six chapters comprise the heart of the volume. They reprint the six papers of 
Hales with Ferguson that together provide the detailed proof of the Kepler Conjec- 
ture. As noted earlier, these papers appeared as a special issue (2006) of Discrete & 
Computational Geometry. However, in this book the short index to definitions that 
appeared in this special issue is omitted; it is replaced by the two indexes at the end 
of the book. 

In 2010 Hales published an addendum and a list of errata to the published proof; we 
have added asterisks in the margin of the reprinted papers locating these corrections, 
which are listed in Part III of the volume on pp. 361-374. 


Part III: A Revision to the Proof of the Kepler Conjecture 

This part presents an important follow-up paper of Hales (with five coauthors) that 
was published in Discrete & Computational Geometry in 2010. It explains Hales’s 
program to obtain a formal proof of the Kepler Conjecture. The initial process of 
formalization uncovered one logical gap in the original proof, and this follow up 
paper provides a correction filling that gap. It also supplies a list of errata to the 
original papers. 


Part IV: Initial Papers of the Hales Program 

This part presents two early papers of Hales on Kepler’s conjecture, which were 
published in Discrete & Computational Geometry in 1997. These papers give his 
original formulation of an approach to proving the Kepler Conjecture via a local 
density inequality, and carry out some initial steps of this approach. They explain and 
establish a basic framework followed in the subsequent proof. As it turned out, to 
obtain a proof, this approach required some modification. This modification included 
changes to the local density inequalities as described in Part IN. The proofs given in 
Parts II and II are independent of these two papers. 


In reading this volume, it should be helpful to start with the introductory Chapter 1. 
Next one might study Chapter 2, which describes the general framework for obtaining 
upper bounds on sphere packing density, in any dimension. At present it is not known 
in which dimensions optimal such inequalities (i.e., tight inequalities) may exist: 
they are known to exist in dimensions 1, 2, and now, by the Hales-Ferguson proof, 
in dimension 3. It seems likely that optimal inequalities might exist in dimensions 
8 and 24 as well. One might then read the historical survey of Hales in Chapter 3, 
which also describes some features of the proof given in the next five chapters. (This 
chapter could alternatively be read before reading Chapter 2.) Next one could look 
at some of the details of the formulation of the proof (Chapter 4). It would also be 
useful to look at the Revision paper (Chapter 9) to see features of the ongoing work 
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towards a formal proof of the Kepler conjecture. Finally the reader may consider the 
remaining chapters in the volume. 

I thank many people who helped me with this editing project. Ricky Pollack 
gave useful general advice. Tom Hales made many useful comments regarding the 
introductory Chapter 1. JoAnn Sears (University of Michigan Library) helped obtain 
a phographic copy of Kepler’s 1611 volume from Cornell University Library. Finally 
I thank Ann Kostant and John Spiegelman for work on the preparation of this volume. 
During the preparation of this work I received support from NSF grants DMS-0500555 
and DMS-0801029. 


Jeffrey C. Lagarias 


Ann Arbor, MI 
January 11, 2011 
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Part I 


Introduction and Survey 


Introduction to Part I 


Part I contains two introductory chapters. 


The first chapter in an introductory paper written for this volume. It gives a sum- 
mary of the history of the problem, starting with the work of Kepler. It describes the 
difficulty of the problem, gives a description of Hales and Ferguson’s 1998 preprints, 
including details of the peer review process of the Hales and Ferguson papers (which 
took 8 years), and subsequent developments. It also includes remarks on the reliabil- 
ity of the proof and the subsequent approach of Hales to obtain a formal proof of the 
Kepler Conjecture (in a formal logical system), entirely checkable by computer. 


The second chapter provides a technical survey of local density approaches to 
obtain upper bounds on sphere packing density in any dimension. It then views the 
Hales-Ferguson approach and other approaches to the Kepler Conjecture within this 
context. It also provides an overview of the Hales-Ferguson proof, as given in the six 
1998 preprints. This paper appeared in 2002, during the 1998—2006 peer reviewing 
process of the Kepler Conjecture proof, and affected the final revised form of the 
proof. 
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The Kepler Conjecture and Its Proof, by J. C. Lagarias 


This introductory chapter gives a brief history of the Kepler problem and indicates 
sources of its difficulty. It describes various approaches to solving the problem, based 
on local density inequalities. It then discusses the proof of Hales and Ferguson and 
the peer-reviewing of their papers. It also discusses further developments, towards 
obtaining a formal proof of the Kepler conjecture, and other applications of the proof 
methodology. 
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The Kepler Conjecture and Its Proof 


Jeffrey C. Lagarias 


Abstract This paper describes work on the Kepler conjecture starting from its statement 
in 1611 and culminating in the proof of Hales-Ferguson in 1998-2006. It discusses both 
the difficulty of the problem and of its solution. 


1 The Kepler Problem for Sphere Packing 


The Kepler conjecture asserts that a densest packing of three-dimensional Euclidean 
space by equal spheres is given by the “cannonball” packing, or face-centered-cubic 
(FCC) packing, which fills space with density aa = 0.74048. This conjecture was 
formulated by Kepler in 1611 [43, pp. 14-17], as follows. After discussing cubical 
packing in two-dimensional layers versus hexagonal packing in two-dimensional layers, 
Kepler says (see Figure 1): 


Now if you proceed to pack the solid bodies as tightly as possible, and set 
the files that are first arranged on the level on top of others, layer on layer, the 
pellets will be either squared (A in diagram), or in triangles (B in diagram). If 
squared, either each single pellet of the upper range will rest on a single pellet 
of the lower, or, on the other hand, each single pellet of the upper range will 
settle between every four of the lower. In the former mode any pellet is touched 
by four neighbors in the same plane, and by one above and one below, and so 
on throughout, each touched by six others. The arrangement will be cubic, and 
the pellets, when subjected to pressure, will become cubes. But this will not be 
the tightest pack. In the second mode, not only is every pellet touched by its 
four neighbors in the same plane, but also by four in the plane above and four 
below, so throughout one will be touched by twelve, and under pressure spherical 
pellets will become rhomboid. This arrangement will be more comparable to the 
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‘Tam fiad ftructuram [olidorum quam pote/t freri arciiffimam pro- 
gredisris, «rdinesg, ordinibus fuperponas ,in plano priuscoaptatos, 
aut yerunt quadrati a aut trigontch: 
nfiquadratt aut finguliglobi ordinis 
[uperioris fingulis fuperftabuat ordt- 
nis inferioris aut contra finguli ordt- 
nis fuperioris [edebunt inter quater- 
nos ordinis infertoris, Priori modo 
tangitur gutlibet gloiws a guatuor 
circumflantibus in eodem plano , ab 

uno fup? afe,ey abvnoinfra fe: cy icin uninerfum a fex alys , critd, 
ordocubicus, G compreffionc facta fient cubi :fed nonerit arétiffima 
ce-*tatio. Posterierimodo praterquam quod quilibet globus aqua- 
‘tuor circumflantibusin eocem planotangitur, etiam &gquatuor infra 
Se 4quatuor fupra fe. Cr ficin uniuer(um a duodecim tangetur; 
Sientg, compreffione ex globofis Rhombica. Ordo hic magis affimila- 
bitur octaedro cr Pyramidi, Coaptatio fret arttiffima: vt nulla pra- 
terea ordine plures globuliinidem vas comping! queant. Rurfum fi 


Fig. 1 Kepler’s 1611 assertion. [Image courtesy of the Division of Rare and Manuscript Collections, Cornell 
University Libraries. ] 


octahedron and pyramid. The packing will be the tightest possible, so that in no 
other arrangement could more pellets be stuffed into the same container.! 


Kepler goes on to describe the FCC packing in more detail, saying (see Figure 2): 


Thus, let B be a group of three balls; set one A, on it as apex; let there 
be also another group, C, of six balls; another, D, of ten; and another, E, of 
fifteen. Regularly superimpose the narrower on the wider to produce the shape 
of a pyramid. Now, although in this construction each one in the upper layer is 
seated between three in the lower, yet if you turn the figure round so that not 
the apex but a whole side of the pyramid is uppermost, you will find, whenever 
you peel off one ball from the top, four lying below it in square pattern. Again 
as before, one ball will be touched by twelve others, to wit, by six neighbors in 
the same plane, and by three above and three below. Thus in the closest pack in 
three dimensions, the triangular pattern cannot exist without the square, and vice 


! “Tam si ad structuram solidorum quam potest fieri arctissimam progredaris, ordinesque ordinibus super- 
ponas, in plano prius coaptatos aut ii erunt quadrati A aut trigonici B: si quadrati aut singuli globi ordinis 
superioris singulis superstabunt ordinis inferioris aut contra singuli ordinis superioris sedebunt inter quaternos 
ordinis inferioris. Priori modo tangitur quilibit globis a quattuour cirucmstantibus in eodem plano, ab uno supra 
se, et ab uno infra se: et sic in universum a six aliis, eritque ordo cubicus, et compressione facta fient cubi: sed 
non erit arctissima coaptatio. Posteriori modo praeterquam quod quilibet globus a quattuor circumstantibus in 
eodem plano tangitur etiam a quattuor infra se, et a quattuor supra se, et sic in universum a duodecim tangetur; 
fientque compressione ex globosis rhombica. Ordo hic magis assimilabitur octahedro et pyramidi. Coaptatio 
fiet arctissima, ut nullo praetera ordine plures globuli in idem vas compingi queant.” [English translation: 
Colin Hardie [43, p. 15] 
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qvadrilateris.Efto enins 8 copulstrinm globorum. Ei faperpone sv- 
nun pro aprce,efto Cr alia copuls fentt sloboram Calis demu dcx 
pS ahiac tindenum® Inz0ne femper angufti- 
orem latiori , ut fist jizura Pyramidis. 
Eft igitur per hanc impufitione:a finguli 
fipertoresfederunt inter trinos wsferiores: 
tamen iam vexfafigura, vt non apex fed 
integrum latus pyramids fit loco fupe- 
riort ,quoties usum globulum degluberis 
cfummis , ixfrs fabunt guatuor ordine 
uadrata. Et rurfiim tangetur vnus glo- 
ws Ut prisms, aduodecimalys ,afexnem- 
pecircumstantibus in eadem plano tribus 
Supra Cy tribus infra. Ita 1m folida coapta- 
tione aré#ifima non potest effe ordo trian- 
SHlaris fine quadrangulari , nec virifiim, 
t Patetigitur, acinos?Punici malt, material 
necefiitate concurrente cum ratwnibus incremetiasinorum, exprimi 
infigurara Rhobici corporis: cum non injeftts frontibuus pertinaciter 
nitantur rotundiex aduer(s acini,fedcedant expuifi, in {paciainter 
ternos vel guaternos oppofitostntertecta. 


Fig. 2 Kepler’s description of the FCC “cannonball” packing. [Image courtesy of the Division of Rare and 
Manuscript Collections, Cornell University Libraries.] 


versa. It is therefore obvious that the loculi of the pomegranate are squeezed into 
the shape of a solid rhomboid.” 


The FCC packing had been noted earlier by the English mathematician Thomas Hariot 
[Harriot] (1560-1621). Hariot was mathematics tutor to Sir Walter Raleigh, designed 
some of his ships, wrote a treatise on navigation, and went on an expedition to Virginia 
in 1585-1587. He computed a chart in 1591 on how to most efficiently stack cannonballs 
using the FCC packing, and computed a table of the number of cannonballs in such stacks 
(cf. Shirley [55, pp. 242—243]). Hariot supported the atomic theory of matter, in which 
case macroscopic objects may be packed arrangements of very tiny spherical objects, 
i.e., atoms [42, Chap. II]. He corresponded with Kepler in 1606-1608 on optics, and 
mentioned the atomic theory in a Dec. 1606 letter as a possible way of explaining why 
some light is reflected, and some refracted, when hitting a liquid. Kepler replied in 1607, 


2 “Esto enim B copula trium globorum. Ei superpone A unum pro apice; esto et alia copula senum globorum 
C, et alia denum D et alia quindenum £. Impone semper angustiorem latiori, ut fiat figura pyramidis. Etsi 
igitur per hanc impositionem singuli superiores sederunt into trinos inferiores: tamen iam versa figura, ut 
non apex sed integrum latus pyramidis sit loc superiori, quoties unum globulum deglberis e summis, infra 
stabunt quattuor ordine quadrato. Et rursum tangetur unus globus ut prius, et duodecim aliis, a sex nempe 
circumstantibus in eodem plano tribus supra et tribus infra. Ita in solida coaptatione arctissima non potest 
ess ordo triangularis sine quadrangulari, nec vicissim. Patet igitur, acinos punici mali, materiali necessitate 
concurrente cum rationaibus incrementi acinorum, exprimi in figuri rhombici corporis” [English translation 
by Colin Hardie [43, p. 17]] 
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not supporting the atomic theory. The known correspondence of Hariot with Kepler does 
not deal directly with sphere packing. 


Questions on sphere packing attracted the attention of Isaac Newton. A discussion 
with the mathematician David Gregory on 4 May 1694 concerning the brightest stars 
was summarized in a memorandum of Gregory [50, Vol II], p. 317] as: 


To discover how many stars there are of a given magnitude, he [Newton] 
considers how many spheres, nearest, second from them, third etc. surround a 
sphere in a space of three dimensions, there will be 13 of first magnitude, 4 x 13 
of second, 9 x 4 x 13 of third? 


Newton’s own star table “A Table of ye fixed Starrs for ye yeare 1671” records 13 first 
magnitude stars, 43 of the second magnitude, 174 of third magnitude; cf. [50, Vol. II, 
p. 394]. In an (unpublished) notebook Gregory considered the packing problem in 2- 
dimensions and 3-dimensions and recorded that 13 spheres might touch a given equal 
sphere [50, Vol. III, p. 321]. It has been asserted that Newton believed only 12 spheres of 
fixed radius could touch a single sphere of the same radius, but I do not know of a primary 
reference for this assertion. Certainly every sphere in the FCC packing touches exactly 
12 neighbors. It is now known that the maximum number of disjoint equal spheres that 
can touch a given equal spheres (the “kissing number’’) is 12, as observed by Hoppe [36] 
in 1874 and shown rigorously in 1954 by Schiitte and van der Waerden [54]. See Conway 
and Sloane [10, Sec 1.2] and Casselman [7] for further discussion and references. 


In the 19th century it was discovered that besides the FCC packing there exists 
another equally dense sphere packing possessing a full translation group of symmetries, 
the hexagonal close packing (HCP). This packing was described by William Barlow [3] 
in 1883, in connection with the possible internal symmetries of crystals. 


In 1900 Hilbert [34] included the sphere packing problem in his famous problem list, 
as part of his 18th problem. He wrote: 


I point out the following question, related to the preceding one, and important 
to number theory and perhaps sometimes useful in physics and chemistry: How 
can one arrange most densely in space an infinite number of equal solids of given 
form, e.g., spheres with given radii or regular tetrahedra with given edges (or in 
prescribed positions), that is, how can one so fit them together that the ratio of 
the filled to the unfilled space may be as great as possible?+ 


See Milnor [48] for a further discussion and history of work on the various parts of 
Hilbert’s 18th problem. 

The topic of sphere packing is of interest in much wider situations in mathematics, 
physics and materials science; in the higher dimensional case it is also important in 


3 “Ut noscatur quot sunt stellae magnitudinis 1 ae, 2 dae, 3 ae & tc. considerando quot spherae proximae, 


seundae ab his 3 ae &tc spheram in spatio trium dimensionis circumstent: erunt 13 primae, 4 x 13 2-dae, 
9x 4x 133 ae.” 

4 “Ich weise auf die heirmit in Zusammenhang stehende, fiir die Zahlentheorie wichtige und vielleicht 
auch der Physik und Chemie einmal Nutzen bringende Frage hin, wie man undendlich viele K6rper von das 
gleichen vorgeschriebenen Gestalt, etwa Kugeln mit gegebenem Radius oder regulire Tetraeter mit gegebener 
Kante (bez. in vorgeschriebener Stellung) im Raume am dictesten einbetten, d. h. so lagern kann, da des 
Verhiltnis des erfiilten Raumes zum nichterfiillten Raume moglichst gro ausfallt.” [English translation: Dr. 
Mary Winston Newson] 
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communications and coding theory. For a physics viewpoint see Aste and Weaire [2] 
and for both mathematical and communications aspects see Conway and Sloane [10]. 
For purely mathematical aspects, see Boréczky [5], Fejes-Toth [15], Rogers [53], and 
Zong [61]. 

This volume includes the complete proof of the Kepler conjecture by Thomas C. 
Hales with Samuel P. Ferguson, first presented in six preprints posted on the math arXiv 
in 1998, and then after extensive revisions, published in six papers in 2006 in Discrete & 
Computational Geometry. (An abridged version of this proof was published earlier by 
Hales in Annals of Mathematics in 2005.) The proof consists of a large body of math- 
ematical arguments and a massive computer verification of many inequalities. There is 
also a follow-up paper written in 2009 (“A revision of the proof of the Kepler Conjec- 
ture’) and published in 2010, supplying more details about one point in the 2006 proof, 
and describing progress towards constructing a formal proof of the Kepler conjecture, in 
a formal language, completely checkable and certifiable by computer. When completed, 
this will be a second-generation proof. 


We view these papers not just as mathematics but also as historical documents. 
Below we describe some aspects of the history of work on the problem, and how the 
work of Hales and Ferguson came about. Then we describe the peer-review process, the 
reliability of this proof, and the ongoing work of Hales to obtain a formal proof of the 
Kepler conjecture. 


As background, we first explain why the problem is hard. 


2 Why the Kepler Problem Is Difficult 


It is not immediately apparent that Kepler’s Conjecture is a problem of extraordinary 
difficulty. 


A first difficulty is to define a rigorous notion of density of packing of spheres. This is 
resolved using limiting notions of packing a large region and letting its diameter increase 
to infinity. This notion was not clarified until the 20th century. One can then prove a result 
asserting that a packing exists that attains a maximal limiting density; cf. [53, Chap. 1]. 


A second difficulty is that notions of limiting density are very crude in the sense 
that one can remove spheres from an arbitrarily large finite region without affecting 
the limiting density. One may therefore wish to impose further local restrictions on the 
notion of a “densest” packing, for example; that it contain no large “holes.” We define 
a saturated packing of spheres to be one into which no new sphere can be inserted, i.e., 
there is no “hole” of diameter 2 or larger in the packing. Besides this, one might be 
able to increase density locally by removing a finite collection of spheres in a region 
and repacking that region to squeeze in one more sphere. This sort of condition seems 
difficult to analyze, but it already shows that one may wish to take account of “local” 
conditions specifying density of a packing, compare Bezdek et al [4]. 


A third difficulty, peculiar to three dimensions, is that there exist uncountably many 
essentially different “optimally dense" packings. Here we consider packings as essen- 
tially different if they are not congruent under a Euclidean motion of space. Consider 
the packing that starts with a planar layer of hexagonally close packed spheres. That is, 
there is a planar slice through this layer that intersects all the sphere centers, giving a 
circle packing of the plane, and this packing is the optimal two-dimensional hexagonal 


9 


10 Jeffrey C. Lagarias 


(a) Packing layer A (b) First layer A, second layer B 


Fig. 3 Hexagonal packing layers viewed from above 


circle packing. Now one can fit a second identical layer of spheres on top of this layer, 
so that the spheres fit as deep as possible in indentations between the spheres in the first 
layer. If we mark one sphere as the center of the first layer, there are two possible ways to 
do this: each such packing occupies 3 of the 6 holes formed by a hexagon and there are 
two choices. One can repeat this in the layer below the first layer, and continue adding 
layers in this fashion, making a choice at each layer. All such packings have the same 
limiting density. If the stacked layers are viewed vertically from above, then spheres in 
all the layers can be seen to line up in three possible positions of the hexagonal lattice, 
which we can label A, B, C, with the packing layer with the marked sphere labelled A. 
These are pictured in Figure 3. Note that the centers in B and C layers are “deep holes” 
in layer A. 


In effect the choices at each layer of the packing can be labelled using letters 
{A, B, C}, with no two consecutive letters the same, to uniquely label a packing (with a 
single marked sphere serving as origin) with a doubly infinite string of such letters. It can 
be shown that two packings are essentially the same (up to a Euclidean motion) if their 
doubly infinite strings of letters can be lined up to agree. Two of these packings are es- 
pecially nice, the face-centered-cubic lattice packing (FCC packing) and the hexagonal 
close packing (HCP packing), described further below. The FCC packing corresponds to 
a repeating pattern ABCABCABC. . . while the HCP packing corresponds to a repeating 
pattern ABABAB. .. (or ACACAC. .. or BCBCBC. . .). The FCC packing was found by 
Kepler, while the HCP packing was first described by Barlow [3] in 1883; cf. Coxeter 
[11, Sec. 22.4]. These packings are described at the beginning of Hales [24] (in this 
volume). 


The collection of “optimally dense” packings just described are locally optimal in a 
very strong sense. A Voronoi domain or Voronoi cell around a given sphere center is the 
set> of all points in space closer to that sphere center than to any other sphere center. In 
saturated packings all Voronoi cells are polyhedrons. For the packings just described, all 


5 This defines the interior of the Voronoi domain. The Voronoi domain itself is the closure of this set, adding 
boundary points which are certain points equidistant to two sphere centers. 
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(a) Rhombic Dodecahedron (b) Trapezoidal Dodecahe- 
dron 


Fig. 4 Voronoi Domains of (a) Type 1 (FCC) (b) Type 2 (HCP) 


Voronoi domains consist of one of two shapes, each having 12 faces and 14 vertices [15, 
p. 173]. (See Figure 4.) 
These are: 

Type 1. Rhombic Dodecahedron (all 12 faces are rhombi). 

Type 2. Trapezoidal Dodecahedron (6 faces trapezoids, 6 faces rhombi) 


Here Type 1 occurs for all Voronoi cells in the middle layer of consecutive layers 
labelled ABC, ACB, BAC, BCA, CAB, CBA, while type 2 occurs for all Voronoi cells 
in consecutive layers labelled ABA, ACA, BAB, BCB, CAC, CBC. Voronoi cells of 
types | and 2 are known to have the same volume and surface area. For each of these 
cells, the ratio of the volume of each sphere to the volume of the Voronoi cell containing 
it is exactly ve Thus in these packings optimality holds locally; that is, it is attained 


simultaneously in each Voronoi cell. In the FCC (Face-Centered Cubic) lattice packing, 
all Voronoi cells are of type 1. In the HCP (Hexagonal Close Packing) packing, all Voronoi 
cells are of type 2. (See [58].) In all remaining packings above, Voronoi cells of both 
types occur in the packing. The packings can be told apart by their different arrangements 
of types | and 2 Voronoi cells. All these packings have at least a two-dimensional lattice 
of translational symmetries; the FCC and HCP packings, and countably many others, 
have a full three-dimensional lattice of translational symmetries. From the mathematical 
viewpoint, as an optimization problem, this means that there are (at least) two different 
local optima, represented by the two types of Voronoi cells above. 

A fourth difficulty is that the optimization problem to maximize density is a priori 
an infinite-dimensional problem: one has infinitely many spheres to pack. Describing 
a packing requires infinitely many variables: three variables each for the coordinates 
of each sphere center. Approaches to make headway with this problem seek to prove 
stronger results which only involve finite-dimensional optimizations that encode local 
conditions. This is the “local density inequality” approach described below and in the 
next section. These approaches in effect assign, by some recipe, to each sphere in a sphere 
packing a (weighted) sum of the covered and uncovered volume near that sphere center. 
This recipe is “local” in that the weighted sum for a given sphere center is completely 
determined by the locations of all spheres in the sphere packing nearby, within a fixed 
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distance C of the given sphere center. When the recipe quantities are added up over all 
spheres, it should count all volume with weight 1. If so, then an upper bound on the 
weighted area will give an upper bound on global sphere packing density. We say such 
a local density inequality is “optimal” if it will produce the upper bound —¥ for the 


V18 
sphere packing density. 

A fifth difficulty is that it is not clear that “optimal” local density inequalities exist, 
and if they do exist they (apparently) cannot be very simple. In support of this view, the 
two most natural ways to locally partition space attached to a sphere packing yield local 
density inequalities that are not “optimal,” as we now explain. 

The first natural way to partition space corresponding to sphere centers is to divide it 
into Voronoi cells around these points, as described above. The region assigned to each 
sphere center is its Voronoi cell. However, it is known that an arrangement of 12 unit 
spheres touching a given unit sphere with their sphere centers being the vertices of a 
regular dodecahedron yields a Voronoi cell that is a regular dodecaheron of inradius 1, 
having a ratio of covered to uncovered volume approximately 0.754697, which exceeds 


Ton 
fia 0.74048. 


A second natural partition of space is the Delaunay tessellation, which is a partition 
of space R? into simplices (tetrahedra) having vertices among the sphere center points. 
We describe it further below. Such Delaunay simplices have their vertices at four sphere 
centers, so that their edge lengths are necessarily 2 or greater. Now we can cut up these 
tetrahedra, for example by barycentric subdivision, and assign parts of the uncovered area 
to each of the four spheres at the corners of the tetrahedron. Each sphere is now assigned 
certain regions associated to each of the Delaunay simplices for which its center is one 
vertex. Hales notes ([24, p. 13] in this volume) that there is a Delaunay tessellation having 
an individual tetrahedron with volume of covered to uncovered volume of 0.78469. Even 
if one sums over all Delaunay tetrahedra associated to a given sphere center, there are 
examples of local configurations with density exceeding the Kepler bound. An original 
approach of Hales in the early 1990s, described in [17], [18], based on the Delaunay 
tessellation, ground to a halt due to combinatorial difficulty (cf. Hales [24, Sec. 2.1]). 

The failure of these two natural decompositions to be “optimal” indicate the necessity 
to consider more complicated inequalities in which the region assigned to a sphere center 
will need in certain cases to “borrow” an excess of uncovered volume from some spheres 
nearby to this sphere. This raises the spectre that perhaps the distance over which volume 
has to be “borrowed” is in fact unbounded. If so, no “optimal” local density inequality 
exists. One would have instead an infinite sequence of such local inequalities, taken over 
larger regions, each giving a better upper bound, tending in the limit to Tie 


A sixth difficulty, then, is that of designing candidate local density inequalities that 
may be “optimal.” If an “optimal” local inequality does exist, then it can in principle 
be verified by a finite computation, i.e., it comprises a finite dimensional non-linear 
optimization problem over a large number of variables, specifying the possible locations 
of sphere centers in a ball of radius C around a given sphere center. The size of the 
problem rapidly goes up as the distance from the initial sphere increases. There can be 
12 spheres touching a given sphere, and the next layer of spheres can contain more than 
30 spheres. There now arises a psychological difficulty, which is that the “optimality” 
of the local density inequality is only certified after the fact, when a proof is found. 
This means that one must first do a very large amount of work, with the downside risk 
of eventually determining that the inequality is not optimal. Therefore, one would like 
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to increase one’s confidence in advance that the inequality considered is likely to be 
“optimal,” by making various preliminary experimentation and computer checks, before 
attempting a full scale proof. 

There are now several reasonable candidates that have been put forward to be “opti- 
mal” local density inequalities. These include one proposed by Laszl6 Fejes-T6th in 1953 
[15] and again in 1964 [16], one proposed by W.-Y. Hsiang [37] in 1993, and several dif- 
ferent such inequalites proposed by T. C. Hales, both alone and with S. P. Ferguson. The 
original local density inequality of Fejes-Toth considers weighted averages of Voronoi 
cells of nearby sphere centers, and that of Hsiang is similar in spirit. In 1991 Hales [17], 
[18] gave an approach proposing “optimal” local inequalities based on the Delaunay 
tessellation, with some Voronoi cell correction terms. In 1994 and later, Hales [20], [21], 
and Hales and Ferguson [25] formulated various candidate “optimal” inequalities using a 
hybrid decomposition of space employing elements of both the Delaunay tessellation and 
the Voronoi domain tessellation. The analysis of Hales and Ferguson suggests that there 
is a large class of such inequalities that will be “optimal,” and there is some flexibility 
in formulating them. These inequalities will however have quite complicated “scoring” 
functions. 

A seventh, and most crucial difficulty for “optimal” local density inequalities is the 
computational size of the problem. Each separate local density inequality asserts an upper 
bound to an enormously complicated finite-dimensional nonlinear optimization problem. 
This problem may simply be too large to be computationally feasible. It involves finding 
the global maximum of a highly non-linear function, over a high dimensional space of 
possible local configurations, enumerated by the location of the centers of all the nearby 
spheres. Further, this seems to involve on the order of 40 spheres, so that the problem has 
approximately 120 dimensions. General nonlinear problems of this dimensionality, with 
no additional structure, are too large to solve at present. In addition, the partitioning of 
space typically results in many combinatorially distinct configurations, so that the space 
of all configurations has a huge number of connected components. The complexity of 
the problem will also depend on the landscape of local optima, i.e., how many there are 
and how close they are to the global optimum. Therefore the local density inequality 
to prove needs to be carefully engineered to have extra properties that will simplify the 
computations. One of the key features of the Hales and Ferguson approach is to design 
such a “local density” inequality, allowing it to be complicated and inelegant, in order to 
make it have properties that reduce the needed computations. Their “Formulation” paper 
fills 49 pages to describe the partition of space and “scoring” function comprising the 
local density measure (see [25] in this volume). The final proof is an intricate blend of 
theory, in part needed to simplify the problem, together with large-scale computations. 

In the Hales-Ferguson proof, in this volume, the local density inequality is designed so 
that the regions to be analyzed split up into (non-interacting) cones whose contributions 
to the “score” can be evaluated separately and then added. It is also necessary for the 
inequality to be proved to have a finite number of (sets of) global maxima. Then the proof 
will logically split into two parts: an exact analytic treatment in the neighborhood of each 
global maximum, proving this property, and then a cruder analysis of the remaining (very 
large) part of the space of possible configurations, establishing inequalities showing a 
bound strictly below the optimum. The second part is the vast bulk of the proof. In the 
actual proof there are two types of global maxima, corresponding to the two types of 
Voronoi domains above. (In the proof these comprise a finite number of different local 
configurations, called decomposition stars, that attain the maximum.) One might think 
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that the difficulty in the proof is in checking near the global maxima, but in fact the greatest 
part of the difficulty is in dealing with the inequalities on the huge remaining portion 
of the configuration space. In fact, it is extremely hard to describe this configuration 
space in a way that is suitable for analysis and computation. The problem is reduced to 
many smaller nonlinear optimization problems, which are relaxed to convex optimization 
problems having a single maximum, that can be bounded above. The devil is in the details. 


3 Local Density Approach: History 


The first person to formulate a local density approach was Laszl6é Fejes-T6th [15], who 
made many deep insights into the sphere packing problem. In 1953 [15, pp. 174-181] 
he indicated the possibility of proving the Kepler conjecture by relating it to a nonlinear 
optimization problem over a compact set. More precisely, he proposed a specific inequal- 
ity that might hold. This inequality involved averages of Voronoi cell volumes of close 
neighbors, and would imply Kepler’s conjecture. In 1964 he restated this inequality [16, 
pp. 295-299] and added: 


Thus it seems that the problem can be reduced to the determination of a 
minimum of a function of a finite number of variables, providing a programme 
realizable in principle. In view of the intricacy of this function we are far from 
attempting to determine the exact minimum. But, mindful of the rapid develop- 
ment of our computers, it is imaginable that the minimum may be approximated 
with great exactitude. 


Such optimization problems can generically be termed “local density inequalities” 
as they involve some weighted measure of density associated to a neighborhood of each 
sphere center in a packing. Establishing a local density inequality yields a result which 
is stronger than the Kepler conjecture, since for optimality it requires the existence of a 
packing maximizing the local inequality simultaneously at every sphere center. 

In particular, each local density inequality asserts a different mathematical result. 
This is so, although each may imply Kepler’s conjecture as a corollary. Assuming that 
the associated optimization problem has a finite set of isolated global maxima, and that the 
local minimality can be verified analytically in an open neighborhood of each of these 
maxima, then in principle the inequality can be proved by computer elsewhere using 
interval arithmetic. However the size of the resulting optimization problems seemed far 
beyond the range of what could be solved by a computer at that time, or even now, 
without new ideas to reduce the complexity of the problem. 

A precise framework for local density inequalities is given in the next paper in this 
volume [45]. Local density is measured at a sphere center by a “score function” which 
assigns to it the volume nearby counted with certain penalties and credits. These penalties 
and credits must have the property that when summed over all sphere centers they cancel 
out, so that volume is then counted with weight at most | everywhere. 

The two major starting points in designing such local density “score functions” are the 
Voronoi tessellation (also called the Dirichlet tessellation, and, in physics, Wigner-Seitz 
cells) and Delaunay tessellation. Given centers of spheres in a saturated sphere packing, 
the Voronoi tessellation partitions space into Voronoi cells, which for each sphere center 
is the (closure of) the set of points in space closer to that sphere center than any other 
sphere center; this region contains the unit sphere. For saturated sphere packings with 
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unit radius spheres, a Voronoi cell is a finite polyhedron having no point at distance 
exceeding 2 from the sphere center. 

A Delaunay tessellation for a set of points in space R? is a subdivision of space 
into simplices (tetrahedra) such that the sphere circumscribing each tetrahedron has no 
points of the set in its interior. Delaunay [12] showed such tessellations exist. They 
are efficiently constructible; cf. Watson [59]. The resulting tessellation is topologically 
dual to the Voronoi tessellation, i.e., vertices of the Delaunay tessellation correspond 
to regions in the Voronoi tessellation, edges in the Delaunay tessellation correspond to 
faces of Voronoi cells, faces in the Delaunay tessellation correspond to edges in Voronoi 
cells, and the interior of a Delaunay cell corresponds to a vertex of a Voronoi cell lying 
in its interior. The Delaunay tessellation is unique for points in general position (no 5 
points at equal distance from one point) but may be non-unique otherwise. In a finite 
region there are only a finite number of choices for the Delaunay simplices. 

As noted above, the Voronoi and Delaunay partitions of space do not directly yield 
“optimal” local density inequalities. Further averagings of space are required. The ap- 
proaches to local density inequalities proposed by Fejes-Toth and by Hsiang [38], [41] 
are based on averaging of Voronoi cell densities of regions near a given sphere. Hales 
proposed a local density inequality based on Delaunay tessellation in 1991 [17, Sec. 4], 
which involves a functional defined on a compact space of (abstract) Delaunay stars. His 
original study indicated that this inequality is near optimal, and that to get an optimal 
inequality some correction terms had to be added; he proposed terms that add some 
information associated to Voronoi cells. He then formulated a candidate “optimal” in- 
equality based on a superposition of Delaunay and Voronoi tessellations [17]. In work 
after 1994 Hales [20], [21] (papers in part IV of this volume) formulated “hybrid” local 
inequalities which decompose space starting from a Delaunay-like decomposition but 
use a Voronoi-type decomposition on that part of space on which the Delaunay simplices 
do not have prescribed shapes. The local density inequality of Hales and Ferguson [25] 
uses a decomposition of space starting from a set of simplices with corners at four sphere 
centers, having allowed shapes, without imposing the “empty sphere” condition of the 
Delaunay tessellation. (They did this to make their local conditions not interact across 
long distances.) 

In 1990 Wu-Yi Hsiang proposed a local density inequality using averages of Voronoi 
cell volumes and announced a proof of it; this would prove the Kepler Conjecture. This 
announcement received much publicity, see Szpiro [56, pp. 144-152]. One idea of Hsiang 
was to use spherical geometry rather than Euclidean geometry to obtain the sphere pack- 
ing bounds: some of the uncovered regions could be treated as spherical triangles, with 
volumes computed using spherical trigonometry. A paper was subsequently submitted 
to Annals of Mathematics. The reviewers there uncovered some difficulties in the proof. 
In response to the reviewer’s criticisms, Hsiang did not resubmit his paper to the Annals 
of Mathematics, and instead submitted it elsewhere. He eventually was able to publish 
a revised 93 page paper in 1993 in the International Journal of Mathematics (Hsiang 
[38]). This journal records that the paper was received 17 November 1992, revised 9 
March 1993, and was in print by the end of 1993, a rapid review for a paper of its length 
and complexity. 

Objections were voiced to the published Hsiang proof by Conway, Hales, Muder and 
Sloane [9], and were detailed by Hales [19] in 1994 in the Mathematical Intelligencer. 
These objections applied not only to specific results claimed in the proof, but also asserted 
that some methods of argument used were invalid. To this critique Hsiang [40] gave a 
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rejoinder. Hsiang [41] later supplied in 2002 a book-length treatment of this proof, along 
with other topics. 

The consensus of the mathematical community, as represented in Mathematical Re- 
views in the reviews of Hsiang’s 1992 paper and 2002 book, is that the Hsiang proof is 
incomplete. The review of Hsiang’s 1993 paper (by G. Fejes-T6th) says: 


I think there is hope that Hsiang’s strategy will work: at least the main in- 
equalities seem to hold. As far as the details are concerned, my opinion is that 
many of the key statements have no acceptable proofs. Typically we are given 
arguments such as “the most critical case is...” followed by the statement that 
“the same method will imply the general case.” The problem with arguments of 
this kind is they require the reader to redo some pages of calculations but, notori- 
ously, that they occur at places where we expect difficulties and most frequently 
it is impossible to see how the same method works in the general case. 


In particular, the critique of this proof asserts that the large mass of possible local con- 
figurations of spheres has not been shown by Hsiang to have his local averaged Voronoi 
density strictly below the optimal bound. Specifically, a major objection is that an essen- 
tial part of Hsiang’s proof approach is flawed, that of using deformation arguments to 
simplify configurations on which the density is nondecreasing. There are notorious exam- 
ples where such monotonicity of density under deformation does not hold, as indicated 
in the objections of Hales [19] above. 

These issues are not considered resolved by Hsiang’s book. Concerning Hsiang’s 
book [41], the reviewer for Mathematical Reviews (U. Hertrich-Jeromin) states: 


The present reviewer is not in a position to judge whether the proof of Main 
Theorem | or of the Kepler conjecture presented in this book is correct, complete, 
or adequate. 


It may be that the Hsiang local density inequality is in fact an “optimal” inequality. 
If so, in principle this might be established by a computer-aided proof. To accomplish 
this there appear to remain extensive issues needing resolution to reduce the problem to 
computation, and it is not clear that the resulting computational problem would be of 
feasible size to attempt. 

In any event, the work of Hales, with Ferguson, described below, successfully estab- 
lishes a different local density inequality implying Kepler’s conjecture. 

It remains possible that in the future new local density inequalities will be discovered 
which will permit alternative proofs of the Kepler Conjecture which are simpler than the 
current proof. 


4 Hales Program and Hales-Ferguson Papers 


An approach to prove the Kepler conjecture was developed by T. Hales starting about 
1990 (see [17], [18]), based on constructing a local density inequality starting from the 
Delaunay tessellation. A significant problem lies in formulating a local density inequality 
of this sort that is: 

(1) expected to be “optimal,” and 

(2) which represents a non-linear optimization problem small enough to be solvable 
in a reasonable time. 
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One of Hales’s contributions was the formulation of a larger, more flexible class of local 
density measures than had been formulated previously; cf. [45, Sec. 2] (in this volume). 

As explained by Hales [24] (in this volume), he eventually found it necessary to mod- 
ify this pure Delaunay tessellation approach to Kepler’s conjecture. He then considered 
an approach that used hybrid inequalities that mixed Voronoi and Delaunay domains. 
A definite program to prove Kepler’s Conjecture based on such a decomposition was 
formulated by Hales in 1994, described in [20], [21] (papers in this volume). Addi- 
tional modifications of the local density inequality turned out to be required to obtain a 
tractable problem. In the (re)formulation process, important contributions were made by 
S. P. Ferguson, a doctoral student of Hales whose published version of his Ph.D. thesis 
work forms the paper Sphere Packing V. Significant innovations of Hales were needed 
to reduce the optimization problem to a tractable size. 

A first version of a complete proof was presented in six preprints, one joint with 
S. P. Ferguson, and one by Ferguson alone, and was posted on the mathematics arXiv in 
1998. The proofs then underwent intensive peer-reviewing and revision, culminating in 
the publication of the proof in 2005 and 2006. 

The initial 1998 papers were written while Hales was a faculty member at the Uni- 
versity of Michigan. At that time Ferguson was a graduate student at the University of 
Michigan, and he subsequently received his Michigan Ph.D. for work on the Kepler 
problem. Some of the revisions on the paper were done after Hales became a faculty 
member at the University of Pittsburgh. 


5 Peer-Reviewing of the Hales-Ferguson Papers 


A week-long workshop on the Hales-Ferguson proof was given at the Institute for Ad- 
vanced Study in January 1999. At that time this editor wrote a summary of the top level 
of the proof tree, which was later included in [45] (paper in this volume). Also in 1999 
Joseph Oesterlé gave a Séminaire Bourbaki exposé on the Hales-Ferguson proof [51]. 

The six papers were submitted to the Annals of Mathematics, to Robert McPherson 
as editor, and a team of at least 13 reviewers was assembled by Gabor Fejes-Toth. It is 
very unusual to have such a large set of reviewers. The main portion of the reviewing 
took place in a seminar at Edtvos University, Budapest, over a three year period. Some 
reviewers made computer experiments, in a detailed check of specific parts of the proof. 
The nature of this proof, consisting in part of a large number of inequalities having little 
internal structure, and a complicated proof tree, makes it hard for humans to check every 
step reliably. In this process detailed checking of many specific assertions found them 
to be essentially correct in every case. This result of the reviewing process produced in 
these reviewers a strong degree of conviction of the essential correctness of this proof 
approach, and that the reduction method led to nonlinear programming problems of 
tractable size. 

The popular book of George Szpiro [56], and a book review by Frank Morgan [49], 
give a status report on how the peer-review process of the proof was viewed as of 2003. 

This editor became involved in the reviewing process in January 2003. At that time 
Hales undertook a substantial revision and reorganization of the papers, making clearer 
the top portion of the proof tree. This revison also led to a simplification of some defi- 
nitions and lemmas. Ferguson’s paper was also revised. The six final published papers 
include added remarks motivating the choice of the given local density inequality, which 
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is very complicated. They also provide some reasons indicating why one should expect 
this particular local density inequality to hold. The revised version of the Ferguson paper 
“Sphere Packing V” now includes an explicit error bound away from the optimal density. 

An abridged version of the proof, authored by T. Hales, was published in Annals 
of Mathematics, 162 (2005), 1065-1185. This paper presents the upper level details of 
the proof tree but excludes the paper of S. Ferguson handling one important case. This 
paper, presenting the upper level details of the proof tree but excluding the paper of 
S. P. Ferguson handling one important case, is supplemented with computer files and 
programs which are available on the Annals of Mathematics website. 

In response to the issues raised by the reviewing of the Hales and Ferguson papers, 
the Annals of Mathematics added, at the front of the issue of the journal in which the 
Hales paper appears, the following “Statement by the Editors” [1]. 


Computer-assisted proofs of exceptionally important mathematical theorems 
will be considered by the Annals. 

The human part of the proof, which reduces the original mathematical prob- 
lem to one tractable by a computer, will be refereed for correctness in the tradi- 
tional manner. The computer part may not be checked line-by-line, but will be 
examined for the methods by which the authors have eliminated or minimized 
possible sources of error: (e.g., round-off error eliminated by interval arithmetic, 
programming error minimized by transparent surveyable code and consistency 
checks, computer error minimized by redundant calculations, etc. [Surveyable 
means that the interested person can readily check that the code is essentially 
operating as claimed]). 

We will print the human part of the paper in an issue of the Annals. The 
authors will provide the computer code, documentation necessary to understand 
it, and the computer output, all of which will be maintained on the Annals of 
Mathematics website online. 


This statement continues to reflect the views of the Editors of Annals of Mathematics, 
and currently appears on the journal’s website. 

In 2006 a detailed version of the proof was published in Discrete & Computational 
Geometry, Vol. 36, No. 1 (2006), 5-265. These papers appear in this book. They are 
revised and rearranged versions of the six 1998 preprints; the 2005 Annals paper repre- 
sented a subset of these papers. These papers appeared as a complete issue of Discrete & 
Computational Geometry, edited by Gabor Fejes-Téth and myself as Guest Editors. In 
an unusual feature, the Foreword to this issue lists by name (some of) the peer review- 
ers: Andras Bezdek, Michael Bleicher, Karoly B6réczky, Karoly Boréczky, Jr., Gabor 
Fejes-Toth, Aladar Heppes, Wlodek Kuperberg, Endre Makai, Attila Por, Giinter Rote, 
Istvan Talata, Bela Uhrin, and Zoltan Ujvary-Menyhard. This set of reviewers includes 
many well-known experts in discrete geometry. 


6 Reliability of the Hales-Ferguson Proof 


To what extent have Hales and Ferguson given a complete proof of the Kepler conjec- 
ture? The high level part of the proof tree is given as a standard mathematical proof. 
It has a definite overall strategy that subdivides the problem into manageable parts. It 
also supplies a rationale for the choice of local density inequality considered. At the 
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bottom level of the proof tree are a large collection of inequalities verified by computer 
calculation. In some cases, where mathematical programming is involved, the computer 
made decisions as to how to break the problem into smaller pieces to achieve a proof. 
There has been considerable controversy over the use of such proofs relying on large 
amounts of computation, of a non-routine sort, which are hard for humans to check. Is 
such a proof absolutely reliable? We have reached a borderline between mathematics 
and science, where the computations can be viewed as an experiment that is in principle 
reproducible by others. 

The topic of the nature and reliability of proofs involves both mathematical and philo- 
sophical issues. A recent discussion of this topic appears in the Philosophical Transac- 
tions of the Royal Society A [6] in October 2005, which mentions in particular the 
Hales-Ferguson computer-aided proof of the Kepler Conjecture. 

A Fields Medal winner, W. P. Thurston [57, Sec. 4], argues that standard (pre-formal) 
mathematical proofs are not absolutely reliable, but “there is a strong social standard of 
validity and truth.” He writes [57, p. 170]: 


Mathematics as we practice it is much more formally complete and precise 
than other sciences, but it is much less formally complete and precise for its 
content than computer programs. 


Supporting the view of human checkability above, Thurston writes [57, p. 170]: 


People are usually not very good in checking formal correctness of proofs, 
but they are quite good at detecting potential weaknesses or flaws in proofs. 


He adds, on revision of proofs: 


Mathematicians can and do fill in gaps, correct errors, and supply more detail 
and more careful scholarship when they are called on or motivated to do so. Our 
system is quite good at producing reliable theorems that can be backed up. It’s 
just that the reliability does not primarily come from mathematicians checking 
formal arguments; it come from mathematicians thinking carefully and critically 
about mathematical ideas. 


What is clear is that Hales and Ferguson have presented a proof schema for formulat- 
ing “local density inequalities” of a kind believed to be optimal inequalities. They have 
presented an effective methodology for formulating a class of “local density” inequalities 
of a type expected to contain many “optimal” inequalities. These involve partitioning 
Euclidean space associated into sphere centers, first into Delaunay-like simplices, pro- 
vided their shapes satisfy suitable restrictions, then partitioning the remaining space 
into Voronoi-like cells. They also assign weights to (subdivisions of) the Delaunay-like 
simplices, to define a “decomposition star” of quantities contributing to the score at a 
given sphere center. They also provided effective reduction methods for the nonlinear 
optimization problem, reducing it to tractable size for computation of the bottom levels 
of the proof tree. They have given very precise details of the proof tree to carry this out 
for a specific “optimal” local density inequalities. These details have been extensively 
peer-reviewed without significant error found. This holds whether or not one believes 
the software has been coded reliably, or the calculations entirely free of errors. Hales and 
Ferguson have presented details sufficient for reconstruction along these lines of proofs 
of a similar kind. 
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To consider the reliability question further, a proof published in a mathematical jour- 
nal is written in a high level language, not in a logical language. To quote the philosopher 
of mathematics Imre Lakatos [47] in the article “What does a mathematical proof prove”: 


Proofs can be divided into (1) pre-formal, (2) formal, and (3) post-formal 
proofs. Of these, (1) and (3) are informal proofs. 


The pre-formal proofs in a mathematical journal typically only form an outline of 
a schema of formal proof, which is intended to be convincing by supplying details 
concerning the key ideas in the proof, and precise details on the most difficult parts of 
the proof. A well-written pre-formal proof often satisfies the dictum [44, p. 315]: 


The proof tells us where to concentrate our doubts. 


A reader must have a certain mathematical background and training to read a pre-formal 
proof. Such a proof can be regarded as a schema intended to be convertible to a proof in 
a formal logical language. The latter would then constitute a formal proof. The resulting 
formal proof would not be easy to read by a human, but it could be reliably checked 
by a computer. In the realm of formal proofs, a digital computer is a more reliable 
proof-checker than a human. 

The proof of the Kepler Conjecture that Hales and Ferguson presented in this volume 
is a pre-formal proof in the Lakatos sense. It provided a strong measure of conviction in 
the judgment of the reviewers, and is reliable in the Thurston sense. 

However, here we may recall that Thurston also says [57, pp. 170-171]: 


When one considers how hard it is to write a computer program even ap- 
proaching the intellectual scope of a good paper, and how much greater time and 
effort have to be put in to make it “almost” formally correct, it is preposterous to 
claim that mathematics as we practice it is anywhere near formally correct. 


This observation motivates the pursuit of a formal proof of the Kepler conjecture. Con- 
structing such a proof is a current project of Hales, discussed below. 

We remark that the final category of proofs considered by Lakatos, that of post-formal 
proofs, consists of metamathematical arguments drawing conclusions about the meaning 
of what has been proved in a formal proof. We do not consider post-formal proof here; 
the meaning of the Kepler Conjecture seems intuitively clear. 


7 Formal Proof of the Kepler Conjecture 


Subsequent to the Hales-Ferguson proof, Hales embarked on a process of constructing a 
formal proof of the Kepler conjecture. This project is termed by Hales the FLYSPECK 
Project (FlysPecK is an acronym for Formal Proof of the Kepler conjecture). It is an 
ongoing project described in Hales et al. [31] (in this volume). 

It is a much more difficult problem to produce and verify a formally checkable proof 
than it is to provide a pre-formal proof. In practice the proofs provided in mathematical 
journals do not supply sufficient details to make the conversion to a formal proof a routine 
procedure. That is, a “pre-formal proof” in a mathematics journal can be regarded as 
somewhat analogous to a software specification in computer science that prescribes a 
recipe for writing software code, but which could conceivably lead to some surprises 
in the actual process of writing and testing code. Such conversion to a formal proof 
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requires a choice of axiom scheme and the axioms that are (often implicitly) assumed 
in the basic framework of the subject. The process of reducing to axiomatization often 
exposes hidden assumptions that are built into the proof. For a viewpoint firmly on the 
side of computers, see Zeilberger [60]. 

The Kepler Conjecture uses only elementary Euclidean geometry in its formulation 
and seems well suited to formalization of the proof. A detailed study of axiomatic foun- 
dations of geometry was undertaken by Hilbert [35], in a book first published in 1899. 
He noted that Euclid’s list of axioms was incomplete, proposed formal axioms for Eu- 
clidean geometry and related geometries, and studied the interdependence of various 
axiom schemes. This work led to some surprises. Some of Hilbert’s original axioms 
were found to be implied by others of his axioms; his book went through many editions 
and corrections; many appendices were added in later editions. In fact, a formal proof 
of the Kepler Conjecture necessarily requires going beyond the framework of Hilbert’s 
axioms because these do not cover the discrete nature of packings. Further work on 
axiomatization is needed. 

Constructing a formal proof of the Kepler Conjecture involves not only logic, it is 
also a giant software project. When completed, such a proof will represent the most 
complicated proof to date that is reduced to formalization. Details on this project are 
presented in the paper Hales et al. [31] (in this volume). 

As mentioned, formalization of a proof can also uncover gaps in a pre-formal proof. 
This was the case here. The initial process of formalizing the proof tree of the published 
proof in the six papers of the Kepler Conjecture did uncover one logical gap in the 
published proof. This gap is described, and filled, in the second part of paper [31] (in 
this volume). 


8 Applications of the Hales-Ferguson Proof Methodology 


The ideas behind the “scoring” method developed for the proof of the Kepler Conjecture 
have been adapted by T. C. Hales, with others, to solve other geometric optimization 
problems. 

Regarding the Voronoi tessellation, in 1943 Laszlo Fejes-Toth[14] proposed the fol- 
lowing conjecture: 


Dodecahedral Conjecture The volume of any Voronoi cell of a unit sphere packing 
in R? is not smaller than the volume of a regular dodecahedron circumscribed around 
the unit sphere. 


In 1998 Hales and S. McLaughlin [32] announced a computer-aided proof of the Do- 
decahedral conjecture, based on similar methods to those in the Hales-Ferguson proof of 
the Kepler conjecture. After a very long peer review process, and substantial rewriting, 
this paper recently appeared in the Journal of the American Mathematical Society [33]. 
Another very old problem concerns partitioning the plane into cells of equal area. 
How should the cells be designed to minimize the average perimeter of the cells? The 
conjectured answer is that each cell should be a regular hexagon, as in a honeybee 
comb. This conjecture was proved by Hales [22] in 2001, by establishing various new 
two-dimensional “local density” inequalities appropriate to the problem. 
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9 Contents of This Volume 


The remainder of this volume consists of a collection of papers reprinted from Discrete 
& Computational Geometry. These are grouped in four parts, as follows. 


PART I. Survey: Local Density Inequalities for Sphere Packing 


The next chapter of Part I reprints a paper of J. C. Lagarias [45] describing local 
density inequalities for giving upper bounds on the density of sphere packings in any 
dimension. This paper can serve as an introduction to the ideas of Hales and Ferguson; 
that is, it describes the particular local density inequality studied by Hales and Ferguson, 
as given in their original 1998 preprints. This paper was written in the middle of the 
review process of the Hales-Ferguson papers, and influenced reorganization of the the 
six papers presenting the proof that appeared in print after peer review (Part I). As it 
turned out, the proof given in the six papers in Part II made some further small modifi- 
cations and simplifications to the formulation of the local density inequality used in the 
1998 preprints, as explained by T. C. Hales in the first of the six papers presenting the 
published proof. 


PART ILI. Proof: Hales-Ferguson Proof of the Kepler Conjecture 


This part of the book reprints the detailed proof of the Kepler Conjecture given in the 
six papers by T. C. Hales with S. P. Ferguson that appeared in Discrete & Computational 
Geometry in 2006, and form the main body of this book. It also reproduces the editorial 
foreword of Gabor Fejes-Toth and Jeffrey C. Lagarias concerning the editing of this 
proof. 


The first of the six papers, “Historical overview of the Kepler Conjecture,’ authored 
by T. C. Hales, provides a historical overview of the problem and some remarks on the 
main features of the proof, and gives motivation for the decomposition of space and 
choice of “score function” used in the proof. 


The second of the six papers, “A formulation of the Kepler Conjecture,” co-authored 
by T. C. Hales and S. P. Ferguson, formulates the partition of space and the weights giving 
the local density “score function.” A primary feature of the partition of space is its use 
of Delaunay-like tetrahedra for the main part of the partition when these are of suitable 
shape (in the two optimal configurations only tetrahedra of this type appear); and then 
the use of Voronoi-like regions for the remaining part of the decomposition. The partition 
of space assigns a “‘decomposition star” to a sphere center, which comprises (weighted) 
volumes of regions in a series of polyhedral cones with apex at the sphere center. There 
are an enormous variety of such decomposition stars. The weighting scheme introduced 
for the tetrahedra is a very important part of the “score function.” This paper formulates 
the precise local density inequality that if proved, will establish Kepler’s conjecture as a 
corollary. 


The third of the six papers, “Sphere Packings III. Extremal cases,” authored by 
T. C. Hales, formulates preliminary geometric inequalities to the proof. It treats the two 
extremal cases where equality will hold in the inequality, corresponding to the two types 
of Voronoi cells | and 2 above. It shows that they are local maxima of the “score function.” 
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The fourth of the six papers, “Sphere packings IV. Detailed bounds,” authored by 
T. C. Hales, is the technical heart of the proof. The proof studies the structure of possible 
decomposition stars that might produce a counterexample, termed contravening decom- 
position stars. These would be decomposition stars with score exceeding a quantity 8 
pt, the “optimal” constant. (Here a “point” pt := 4tan~!(¥2) — = © 0.05537.) As- 
sociated to a decomposition star is a graph corresponding to intersecting the cones in 
the decomposition with a small sphere around the sphere center. A priori the graph of 
a decomposition star could have disconnected pieces, as well as have regions that are 
not convex, etc. The proof aims to show the set of graphs of contravening decomposi- 
tion stars is empty. As a step towards this, this paper shows the graphs associated to a 
contravening decomposition star must be very restricted in geometry. The regions in the 
associated graph are connected and polygonal with at most 8 sides, with various further 
restrictions concerning weights associated with the polygonal regions. It relies on many 
detailed computer calculations. 


The fifth of the sixth papers, “Sphere packings V. Pentahedral prisms,” authored by 
S. P. Ferguson, treats the case of a decomposition star of a particular shape, having a 
particular associated graph, called a “pentrahedral prism.” It proves that the score is 
strictly below the optimal value 8 pt in these cases. 


The sixth paper, “Sphere packings VI. Tame graphs and linear programs,” authored 
by T. C. Hales, deals with the large mass of possible contravening decomposition stars 
remaining from the fourth paper. The first part of this paper describes a family of abstract 
planar graphs called tame, having no faces of degree 9 or greater, and other properties. 
It classifies all such graphs, showing there are a few thousand of them, and then uses 
the results of the fourth paper, with minor modifications, to show that any contravening 
decomposition star must have associated graph a tame graph, according to this definition. 
The second part of this paper uses linear programming to obtain upper bounds on the 
score of decomposition stars having associated tame graphs. These computations rule out 
all tame graphs as contravening, with three exceptions. The exceptions are all tame graphs 
corresponding to the FCC Voronoi cell, the HCP Voronoi cell, and the pentahedral prism. 
These three cases were already treated in the third and fifth papers. The local density 
inequality follows. 


Strictly speaking, these six papers describe only the upper levels of the proof tree, 
since at the bottom lies a massive computer verification of many inequalities. References 
to computer verifications is provided in the papers. 


As remarked above, an abridged version of this proof, omitting details of proof for a 
result of S. Ferguson (given in the fifth paper above, [28]). was published by T. C. Hales 
[23] in 2005 in the Annals of Mathematics, the top mathematical journal. 


PART IIl. Revision: A revision of the proof of the Kepler Conjecture 


Work on the proof did not cease with its publication. This part contains the paper “A 
revision of the Proof of the Kepler Conjecture,’ authored by T. C. Hales, J. Harrison, 
S. McLaughlin, T. Nipkow, S. Obua, and R. Zumkeller [31], which appeared in 2010. 


The first part of this paper describes ongoing work to produce a formal proof of the 
Kepler Conjecture. This project is termed by Hales the FLYSPECK project. The goal 
is to provide a proof of the Kepler conjecture within a formal logical language, entirely 
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checkable by computer. The second part of the paper deals with the discovery of one 
gap in logic in the original proof, uncovered in the process of formalization. This part 
of the paper, written by T. C. Hales alone, fills in that gap in the original proof. At the 
end, this paper supplies a list of errata in the original proof in Discrete & Computational 
Geometry. The locations of these errata in the original proof are indicated by asterisks 
added in the margins of the papers in Part II of this volume. 


PART IV. Prehistory: Initial Hales papers on the Kepler Conjecture 


For historical purposes, this volume also includes two of T. C. Hales’s earlier papers 
in 1997 on the Kepler conjecture. These papers, “Sphere Packings I” [20] and and “Sphere 
Packings II” [21], give his original formulation of this research program to prove the 
Kepler conjecture. Later work required a modification of Hales’s original approach. In 
particular in the published proof the scoring function was given a new definition, slightly 
different from the one used in “Sphere Packings I” and “Sphere Packings II.” The final 
version of the six papers giving the proof above were written to be logically independent 
of the first two papers. However certain auxiliary results were first proved in “Sphere 
Packings I’ and “Sphere Packings II.” 


Acknowledgements I thank Gabor Fejes-Toth for editorial work on the DCG Special Issue in July 2006. I 
thank him, Thomas C. Hales, Steven G. Krantz, Richard Pollack, and K. Soundararajan for helpful comments 
on this paper. 
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Bounds for Local Density of Sphere Packings and the 
Kepler Conjecture, by J. C. Lagarias 


This paper presents a framework for local density inequalities giving upper bounds for 
sphere packing density in arbitrary dimensions. It discusses in detail such inequalities 
in the three-dimensional case, and the Kepler conjecture. 


Sections 4 and 5 of this paper outline the Hales-Ferguson proof as presented in 
their six 1998 preprints. The final version of the proof given in their 2006 papers, 
given in Part II, made further small changes from the outline here. In particular, their 
treatment of V-cells is modified, as described in Section 5.1 of the paper T. C. Hales 
and S. P. Ferguson, “A Formulation of the Kepler Conjecture”; see especially Re- 
marks 5.10 and 5.11. The modified treatment eliminates the need to handle “tips” as 
described in Definition 4.5 of this paper. 
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Abstract. This paper formalizes the local density inequality approach to getting upper 
bounds for sphere packing densities in R”. This approach was first suggested by L. Fejes 
T6th in 1953 as a method to prove the Kepler conjecture that the densest packing of unit 
spheres in R? has density z/ /18, which is attained by the “cannonball packing.” Local 
density inequalities give upper bounds for the sphere packing density formulated as an 
optimization problem of a nonlinear function over a compact set in a finite-dimensional 
Euclidean space. The approaches of Fejes Toth, of Hsiang, and of Hales to the Kepler 
conjecture are each based on (different) local density inequalities. Recently Hales, together 
with Ferguson, has presented extensive details carrying out a modified version of the Hales 
approach to prove the Kepler conjecture. We describe the particular local density inequality 
underlying the Hales and Ferguson approach to prove Kepler’s conjecture and sketch some 
features of their proof. 


1. Introduction 


The Kepler conjecture was stated by Kepler in 1611. It asserts that the face-centered 
cubic lattice gives the tightest possible packing of unit spheres in R*. 


Kepler Conjecture. Any packing Q of unit spheres in R? has upper packing density 


IU 
p(Q) < —— ~ 0.740480. (1.1) 
J18 


The definition of upper packing density is given in Section 2. The problem of proving 
the Kepler conjecture appears as part of Hilbert’s 18th problem, see [Hi]. 

There have been many attempts to prove Kepler’s conjecture. Since the 1950s these 
have been based on finding a local density inequality that gives a (sharp) upper bound 
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on the density. In recent years Hales has developed an approach for proving Kepler’s 
conjecture based on such an inequality, and in 1998 he announced a proof, completed 
with the aid of Ferguson, presented as a set of six preprints. The proof is computer- 
intensive, and involves checking over 5000 subproblems. It involves several new ideas 
which are indicated in Sections 4 and 5. 

Local density inequalities obtain upper bounds for the sphere packing constant via an 
auxiliary nonlinear optimization problem over a compact set of “local configurations.” 
They measure a “local density” in the neighborhood of each sphere center separately. The 
general approach to the Kepler conjecture is first to find a local optimization problem that 
actually attains the optimal bound 2/./18 (assuming that one exists), and then to prove 
it. This approach was first suggested in the early 1950s by Fejes Toth [FT1, pp. 174— 
181], who presented some evidence that an optimal local density inequality might exist 
in three dimensions. In 1993 Hsiang [Hs1] presented another candidate for an optimal 
inequality. 

The objects of this paper are: 


(i) To formulate local density inequalities for sphere packings in arbitrary dimension 
R", in sufficient generality to include the known candidates for optimal local 
inequalities in R?. 

(ii) To review the history of local density inequalities for three-dimensional sphere 
packing and the Kepler conjecture. 

(iii) To give a precise statement of the local density inequality considered in the 
Hales—Ferguson approach. 

(iv) To outline some features of the Hales—Ferguson proof. 


In Section 2 we present a general framework for local density inequalities, which is 
valid in IR”, given as Theorem 2.1. This framework is sufficient to cover the approaches 
of Fejes T6th, Hsiang, and Hales and Ferguson to Kepler’s conjecture. A different frame- 
work for local density inequalities appears in [O]. In Section 3 we review the history 
of work on local optimization inequalities for Kepler’s conjecture. In Section 4 we 
describe the precise local optimization problem formulated by Ferguson and Hales in 
[FH], which putatively attains 2//18. In Section 5 we remark on some details of the 
proof strategy taken in the papers of Hales [SP-I], [SP-II], [SP-I1], [SP-I'V], [KC], 
Ferguson and Hales [FH], and Ferguson [SP-V]. In Section 6 we make some concluding 
remarks. 

The current status of the Hales—Ferguson proof is that it appears to be sound. The 
proof has been examined in fairly careful detail by a team of reviewers, but it is so 
long and complicated that it seems difficult for any one person to check it. This pa- 
per is intended as an aid in understanding the overall structure of the Hales—Ferguson 
proof approach, as presented in the preprints. For another account of the Hales and 
Ferguson work, see [O]. For Hales’ own perspective, see [Ha4]. It may yet be some 
time until a version of the Hales—Ferguson proof is published; significant simplifica- 
tion may yet occur, and the local density inequality to be proved in the final version, 
which we may call “Version 2.0,” may differ in some details from the one described in 
Section 4. 

Two appendices are included which contain some information relevant to the Hales— 
Ferguson proof given in the preprints. Appendix A describes some of the Hales—Ferguson 
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scoring functions. Appendix B lists references in the Hales and Ferguson preprints for 
proofs of lemmas and theorems stated without proof in Sections 4 and 5. 
This paper is a slightly revised version of the manuscript [L]. 


Notation. B, := B,(0; 1) = {x € R”: ||x|]| < 1} is the unit n-sphere. It has volume 
C= al? | T(n/2 4+ 1), with k2 = mw and x3 = 47/3. We let C, (x, T) := x + [0, T]" 
denote an n-cube of sidelength 7, with sides parallel to the coordinate axes, and lowest 
corner at x € R”. 


2. Local Density Inequalities 


In this section we present a general formulation of local density inequalities. We first 
recall the standard definition of sphere packing densities, following Rogers [R2]. Let 
Q denote a set of unit sphere centers, so that ||v — v’|| > 2 for distinct v, v’ € Q. The 
associated sphere packing is Q + B,. 


Definition 2.1. 


(i) For a bounded region S in R”, and a sphere packing Q + B,, specified by the 
sphere centers Q2, the density p(S) = p(Q, S) of the packing in the region S is 


__ vol(SA(Q+B,)) 
p(S) = aes (2.1) 


(ii) For T > 0 the upper density p(Q2, T) is the maximum density of the packing Q 
over all cubes of size T, 1.e., 


P(Q,T) := sup p([0, T]” +x). (2.2) 


xeR3 


Then the upper packing density of Q is 
p(Q) := limsup p(Q, T). (2.3) 
T—0o 
(iii) The sphere packing density 5(B,,) of the ball B,, of unit radius is 


6(Bn) = nee p(&2). (2.4) 


Definition 2.2. A sphere packing Q is saturated if no new sphere centers can be added 
to it. 


To obtain sphere packing bounds it obviously suffices to study saturated sphere pack- 
ings, and in what follows we assume that all packings are saturated unless otherwise 
stated. 
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Definition 2.3. An admissible partition rule is a rule assigning to each saturated pack- 
ing Q in R” a collection of closed sets P(Q) := {Ra = R,(2)} with the following 
properties: 


(i) Partition. Each set Ry is a finite union of bounded convex polyhedra. The sets 
R,, cover R? and have pairwise disjoint interiors. 

(11) Locality. There is a positive constant C (independent of 2) such that each region 
Ry has 


diameter(R,) < C. (2.5) 


Each Ry is completely determined by the set of sphere centers w € Q with 
distance(w, Ry) < C. (2.6) 


There are at most C regions intersecting any cube of side 1. 
(iii) Translation-Invariance. The partition assigned to the translated packing Q! = 
Q +x consists of the sets {Ry (&2) + x}. 


Definition 2.4. An admissible weighting rule or admissible scoring rule o for an ad- 
missible partition rule in IR” is a rule which for each Q assigns to each pair (Ro, V) 
consisting of a region Ry € P(Q) and a center v € Q, a real weight o (Ry, v) which 
satisfies |o(Ra, v)| < C* for an absolute constant C*, and which has the following 
properties: 


(i) Weighted Density Average. There are positive constants A and B (independent 
of (2) such that for each set Ry, 


d oie. v) = (Ap(Ra) — B) vol(Ra), (2.7) 
veQ 

where 
P(Ra) vol(Ra) = vol(Ry 1 (Q + B,)) (2.8) 


measures the volume covered in Ry by the sphere packing Q with unit spheres. 

(ii) Locality. There is an absolute constant C (independent of {2) such that each 
value o(Ry, Vv) is completely determined by the set of sphere centers w € Q 
with ||w — v|| < C. Furthermore, 


o(Ry, Vv) =0 if dist(v, Ry) > C. (2.9) 


(iii) Translation-Invariance. The weight function o’ assigned to the translated pack- 
ing Q’ = Q +x satisfies 


o'(Ry + X,V +x) =a(Ry, V). (2.10) 


Note that this definition specifically allows negative weights. 
The “local density” is measured by the sum of the weights associated to a given vertex 
Vv in a saturated packing. 
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Definition 2.5. 


(i) The vertex D-star (or decomposition star) D(v) at a vertex v € Q consists of all 
sets Ry € P(Q) such that o (Ry, v) £ 0. 
(ii) The total score assigned to a vertex D-star D(v) at v € Q is 


Score(D(v)) = > o(Ry, V). (2.11) 
R,€D(v) 


The total score at v depends only on regions entirely contained within distance C of 
v. Any admissible partition and weight function (P, 0) together yield a local inequality 
for the density of sphere packings, as follows. 


Theorem 2.1. Given an admissible partition rule and an admissible weighting rule 
(P, 0) for saturated packings in R", set 


06=0p,,(A, B):= sup (sup Score(D(v))), (2.12) 


Q saturated vEQ 


and suppose that 9 < k,A, where k, is the volume of the unit n-sphere. Then the 
maximum sphere packing density satisfies 


K, 


Remarks. (1) We let f(A, B, 0) := k,B/(k,A—@) denote the packing density bound 
as a function of the score constants A and B. The sphere packing density bound actually 
depends only on the score constant ratio B/ A, rather than on B and A separately, since 0 
is ahomogeneous linear function of A and B. This ratio detemines the relative weighting 
of covered and uncovered volume used in the inequality. 

(2) A natural approach to sphere packing bounds, used in many previous upper bounds, 
is to partition space into pieces R(v) corresponding to each sphere center v, with each 
piece containing the unit sphere around v. Then one obtains a bound p(Q) < 6 by 
establishing an upper bound 


p(R(v)) = all veg. (2.14) 


vol(R(v)) <9, 


Any optimal sphere packing bound of this type must necessarily be volume-independent, 
in the sense that if equality is to be attained at all local cells R(v) simultaneously, then 
they must all have the same volume. In contrast the inequality of Theorem 2.1 does take 
into account the volumes of the individual pieces in the vertex D-star. This flexibility 
results in a larger class of local density inequalities. The idea of using a score constant 
ratio B/A not equal to the optimal packing density is due to Hales [Hal], [Ha2], cf. 
Appendix B. 


Proof of Theorem 2.1. We may assume that Q is saturated. Given T > Oandanye > 0, 
we choose a point x € R” which attains the density bound o(Q, T) on the cube C,, (x, T) 
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to within ¢. We evaluate the scores of all vertex D-stars of vertices v € QQ C,,(x, T) in 
two ways. First, by definition of 0, 


> Score(D(v)) < 6#|2N Cy (x, T)]. (2.15) 
veQnc, (x,T) 


However, we also have 


> Score(D(v)) 


> (x o(Ra, ») 
ve Qn, (x,T) ve Qn, (x,T) a 
= > ( = v(t.) + or") 


RacCa(X.7) veQnC, (x,T) 
= D2 Cp(Ra) — B) vol(Ry) + O(T"™™!) 
Ra SC, (x; T) 
= K, A#|QN C,(x, T)| — B vol(C, (x, T)) + O(T""!) 
= Kk, A#|QNC,(x, T)| — BT’ + O(T"}). (2.16) 


Here we use that fact that {R,,} partitions R”, so covers the cube, and the O(T"~') error 
terms above occur because the counting is not perfect within a constant distance C of 
the boundary of the cube. Combining these evaluations yields 


(kn A — 0)#|20C, (x, T)| < BT” + O(T""). 


If 0 < x,A, then we can rewrite this as 


#(Q.N C,, (x, T)| B 1 
< +0 : (2.17) 
T* K,A —@ 


By assumption 


a ae vol(C(x, T) N(Q + B)) 


T3 
#/Q.0 C(x, T 1 
5 | (x, T)| +0 ( ) 
T” T 
Together with (2.17), this yields 
5(2,T) -e< “2 _+0 : (2.18) 
; € ; _ 
ss = nA — 0 T 


with an O-symbol constant independent of ¢. Letting e — O and then T — o& gives 
the inequality for ({2). Since this holds for all saturated packings the result follows. 


Determining the quantity @p,,(A, B) for fixed A, B can be viewed as a nonlinear 
optimization problem over a compact set. The translation-invariance property of (P, a) 
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allows the supremum (2.12) to be taken over the smaller set with v = 0 and admissible 
Q containing 0. The locality property shows that the vertex D-star at 0 is completely 
determined by w € & with ||w|| < C. The set of such configurations of nearby sphere 
centers forms a compact set in the Euclidean topology. Actually the partition and weight 
functions may be discontinuous functions of the locations of sphere centers, so the 
optimization problem above is not genuinely over a compact set. One must compactify 
the space of allowable vertex D-stars by allowing some sets of sphere centers to be 
assigned more than one possible vertex D-star. In practical cases there will be a finite 
set of types of vertex D-stars, hence a finite upper bound on the number of possibilities. 


Definition 2.6. A local density inequality in R” is optimal if 
f(A, B, OP) = 6(B,). (2.19) 


Optimal local density inequalities exist in one and two dimensions. In discussing the 
three-dimensional case, we presume that 5(B3) = 2/./18, so that an optimal density 
inequality in R? will refer to one achieving this value. The evidence indicates that there are 
many different possible optimal local density inequalities in three dimensions, including 
that of the Hales and Ferguson proof. 

There are currently four candidates for local density inequalities that may be optimal 
in three dimensions. The first is that of Fejes Toth, described in Section 3, which uses 
averages over Voronoi domains, in which the score constant ratio B/A = 2/4/18. The 
second is that of Hsiang [Hs1], which is a modification of the Fejes Toth averaging, 
and uses the same score constant ratio. The third is due to Hales [SP-I], and is based on 
the Delaunay triangulation, using a modified scoring rule described in Section 3. The 
fourth is that given by Ferguson and Hales [FH], and uses a combination of Voronoi- 
type domains and Delaunay-type simplices, with a complicated scoring rule, described 
in Section 4. In the latter two cases the score constant ratio is 


B 30 +12 V3 
Py Y3) 0.720903. 


A V8 
In all four cases the compact set of local configurations to be searched has very high 
dimension. Each sphere center has three degrees of freedom, and the number of sphere 
centers involved in these methods to determine a vertex D-star seems to be around 50, 
so the search space consists of components of dimension up to roughly 150. 

It is unknown whether optimal local density inequalities exist for the sphere packing 
problem in R” in any dimension n > 4. In dimensions 4, 8, and 24 it seems plausible 
that the minimal volume Voronoi cell in any sphere packing actually occurs in the 
densest lattice packing. If so, the Voronoi cell decomposition would yield an optimal 
local inequality in these dimensions, and the densest packing would be a lattice packing 
in these dimensions. Another question asks: in which dimensions is the maximal sphere 
packing density attained by a sphere packing whose centers form a finite number of 
cosets of an n-dimensional lattice? Perhaps in such dimensions an optimal local density 
inequality exists. 

The state of the art in sphere packings in dimensions 4 and above is given in [CS] and 
[Z]. Apart from this, a very interesting new approach giving upper bounds for the density 


ee) 
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of sphere packings appears in the thesis of Henry Cohn [C]. His bounds are nearly sharp 
in dimensions 8 and 24, and strongly suggest that the E's and Leech lattice Az4 packings 
are optimal in these dimensions, respectively. 


3. History 


We survey results on local density inequalities in three dimensions. The work on local 
density bounds was originally based on two partitions of R* associated to a set Q of 
sphere centers: the Voronoi tesselation and the Delaunay triangulation. Since they will 
play an important role, we recall their definitions. 


Definition 3.1. The Voronoi domain (or Voronoi cell) of v € Q2 is 
VWoor(V) = Wor(v, &2) = {x € R?: |x — v|| < ||x — w|| for all w € Q}. (3.1) 


The Voronoi tesselation for Q is the set of Voronoi domains {Vyo,(v): v € Q}. 


The Voronoi tesselation is a partition of space up to boundaries of measure zero. If Q 
is a saturated sphere packing, then all Voronoi domains are compact sets with diameter 
bounded by 4. Indeed, for a saturated packing the Voronoi domain is contained in the 
sphere of radius 2 centered at v, as these spheres form a covering. 


Definition 3.2. The Delaunay triangulation associated to a set Q is dual to the Voronoi 
tesselation. It contains an edge between every pair of vertices that have Voronoi domains 
that share a common face. Suppose now that the points of Q are in general position, which 
means that each corner of a Voronoi domain has exactly four incident Voronoi domains. 
In this case these Voronoi domains have between them four faces that touch this corner, 
and these faces in turn determine (four edges of) a Delaunay simplex. The resulting 
Delaunay simplices partition R* and make up the Delaunay triangulation. In the case of 
non-general position Q the Delaunay triangulation is not unique. The possible Delaunay 
triangulations are determined locally as limiting cases of general position points. (There 
are only finitely many triangulations possible in any bounded region of space.) 


All the simplices in a Delaunay triangulation have vertices vj € © and contain no 
other point v € Q. We define, more generally: 


Definition 3.3. A D-simplex (or Delaunay-type simplex) for Q is any tetrahedron T 
with vertices v;, V2, V3, V4 € Q such that no other v € Q is in the closure of T. We 
denote it D(v,, Vo, V3, V4). 


In the literature a Delaunay simplex associated to a point set Q is any simplex with 
vertices in Q whose circumscribing sphere contains no other vertex of Q in its interior. 
All simplices in a Delaunay triangulation of Q are Delaunay simplices in this sense, so 
are necessarily D-simplices, but the converse need not hold. 

The admissible partitions that have been seriously studied all consist of a domain 
Vv) associated to each vertex v € V, which we call a V-cell, together with a collection 
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of certain D-simplices D(vj, V2, V3, V4) which we call the D-system of the partition. We 
use the term D-set to refer to a D-simplex included in the D-system. We note that a 
V-cell may consist of several polyhedral pieces, and may even be disconnected. 

The original approach of Fejes Toth to getting local upper bounds for sphere packing 
in R? used the Voronoi tesselation associated to Q. If Q is a saturated packing, then each 
Voronoi domain V,o;(v) is a bounded polyhedron consisting of points within distance at 
most 4 of v. Examples are known of Voronoi domains in a saturated packing that have 
44 faces; an upper bound for the number of faces of a Voronoi domain of a saturated 
packing is 49. The Voronoi partition takes the V-sets V (v) to be the Voronoi domains of 
v, with no D-sets, and the vertex D-star D,o-(Vv) is just Vio,(v). A Voronoi scoring rule is 


Score(Dyor(v)) -= (e(V(v)) — B) vol(V(v)), (3.2) 


with the score constants A = | and B is to be chosen optimally. Such scoring rules 
are admissible. However, it has long been known that no Voronoi scoring rule gives an 
optimal inequality (2.19). The dodecahedral conjecture states that the maximum packing 
density of a Voronoi domain is attained for a local configuration of 12 spheres touching 
at the center of faces of a circumscribed regular dodecahedron. 


Dodecahedral Conjecture. For a Voronoi domain V,o,(v) of a unit sphere packing, 


ma 
p(Vrec)) © 7561 — Seats) tanta yay ~ 0794897 (3.3) 


and equality is attained for the dodecahedral configuration. 


A proof of the dodecahedral conjecture has been announced by Hales and McLaughlin 
[Dod], based on similar ideas to the Hales’ approach to the Kepler conjecture. 

In 1953 Fejes Toth [FT1] proposed that an optimal inequality might exist based on 
a weighted averaging over Voronoi domains near a given sphere center, and in 1964 he 
made a specific proposal for such an optimal inequality. In the notation of this paper he 
used the Voronoi partition and an admissible scoring function of the form 


0 (Vyor(W), V) == @(W, V){(Ap(Vyor(w)) — B) vol(Vvor(w))}, (3.4) 
in which A = 1, B = 2/18, and the weights w(w, v) are given by 


1 . 
Pe ee ee if 2<|lw—vil_<2+¢, (3.5) 
0 if |Iw—vlj|>2+42, 


and 
ov, v) = 1— >) a(w,y). (3.6) 
wv 
Here ¢ > 0 is a fixed constant. The resulting inequality in Theorem 2.1 is optimal if 
the associated constant @p.,,(A, B) = 0. The Fejes-Téth scoring function corresponds 


to a weighted averaging over the spheres touching a central sphere at v, where the 
weight assigned to the central sphere depends on how many spheres touch it. Fejes Téth 
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considered choosing ¢ as large as possible consistent with requiring that w(v, v) > 0, 
which is equivalent to requiring that it is impossible to pack 13 spheres around a given 
sphere, with all 13 sphere centers within distance 2 + t of the center of the given sphere. 
In 1964 he suggested [FT2, p. 299] that one could take the value t = 0.0534, and we 
consider this to be Fejes Toth’s candidate for an optimal inequality. It has not been 
demonstrated that w(v, v) > O holds for this value of f, but note that the argument of 
Theorem 2.1 is valid for any value of t, even if some w(v, v) are negative. The only issue 
is whether the resulting sphere packing bound is optimal. Fejes Toth [FT2] explicitly 
noted that establishing an optimal inequality, if it is true, reduces the problem in principle 
to one in a finite number of variables, possibly amenable to solution by computer. 

In 1993 Hsiang [Hs1] studied a variant of the Fejes Téth approach. He used the 
Voronoi partition and an admissible scoring function! of the form (3.4), with the same 
A= 1, B=2/¥18, and with the weights w(w, v) > 0 given by 


1 


maieliage oO 100" (3.7) 
0 if |w—vil > 3, 
where 
N(v) = #{w € Q: 0 < |lw—vl] < 28} (3.8) 


counts the number of “near neighbors” of v. Hsiang announced that his local inequality 
is optimal (with @p,,,(A, B) = 0), and that he had proved it, which would then con- 
stitute a proof of Kepler’s conjecture. However, the proof of optimality presented in 
[Hs1] is regarded as incomplete by the mathematical community, see G. Fejes Téth’s 
review of Hsiang’s paper in Mathematical Reviews, the critique in [Ha3], and Hsiang’s 
rejoinder [Hs2]. 

In 1992 Hales [Hal], [Ha2] studied the Delaunay triangulation, which partitions R 
into D-sets. There are a finite number of (local) choices for Delaunay triangulations 
of a neighborhood of a fixed v € 2. Hales used the following function in defining his 
associated weight function. 


Definition 3.4. The compression of a finite region R in R? with respect to a sphere 
packing Q is 
P(R) = (P(R) — doct) vol(R) (3.9) 
in which 
—3 12 1/V3 
sje tet ators) yaanous (3.10) 
/8 


is the fraction of the volume of the regular octahedron of sidelength 2 covered by unit 
spheres centered at its vertices. 


' More accurately, the locally averaged density in Section 3 of [Hs1] is converted to the form given in 
Section 2 by clearing denominators, and cancelling out Hsiang’s factor of 13. Note also that each Voronoi 
domain contains exactly one sphere, so that p(V(V)) vol(V(V)) = 42/3. 
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Hales initially considered the admissible weight function 
o (D(¥1, V2, V3, V4), Vi) = P(D(M1, V2, V3, V4))- (3.11) 


The vertex D-star of v consists of all the simplices in the Delaunay triangulation that 
have v as a vertex; we call this set of simplices the Delaunay D-star Dpe\(v) at v. Hales 
used score constants A = 4 and B = 46oct, with A = 4 used since each simplex is 
counted four times. However, he discovered that the pentagonal prism attained a score 
value exceeding what is needed to prove Kepler’s conjecture. The pentagonal prism is 
conjectured to be extremal for this score function. 

The fact that the (conjectured) extremal configurations for the Voronoi tesselation and 
Delaunay triangulation do not coincide suggested to Hales that a hybrid scoring rule be 
considered that combines the best features of the Voronoi and Delaunay scoring function. 
In 1997 Hales again considered a Delaunay triangulation, but modified the scoring rule 
to depend on the shape of the D-simplex D(v,, V2, V3, V4). For some simplices he used 
the weight function above, while for others he cut the simplex into four pieces, one for 
each vertex, called the pieces V(D, v;), and assigned the weights” 


ao (D(v}, V2, V3, V4), Vi) = 40 (V(D, v;)), 


for 1 < i < 4. He also partitioned a vertex D-star into pieces called “clusters” whose 
score functions could be evaluated separately and added up to get the total score. Each 
“cluster” is a finite union of Delaunay simplices filling up that part of the vertex D- 
star at v lying in a pointed cone with vertex v. This vertex cone subdivision facilitates 
computer-aided proofs by decomposing the problem into smaller subproblems. Hales 
[SP-I], [SP-II] presented evidence that this modified scoring function satisfies an optimal 
local inequality. He showed that the two known local extremal configurations? gave local 
maxima of the score of the Delaunay D-star in the configuration space. The associated 
scoring function is 


Score(Dpei(V)) = 8pt, (3.12) 


where 


llz 1 
pt := —— — 12arccos (=) ~ 0.0553736, (3.13) 
3 V3 
which is the desired optimal value. He also showed that this was a global upper bound 
over the subset of configurations described by a vertex map* G(v) that is triangulated. 
Hales [SP-I, Conjecture 2.2] conjectured that the modified score function achieved the 
optimal inequality 


Score(Dpei(V)) < 8pt, (3.14) 


? More precisely, he used here the “analytic continuation” of this scoring function that is described in 
Appendix A. 

3 These correspond to Voronoi cells being the rhombic dodecahedron or trapezo-rhombic dodecahedron 
on p. 295 of [FT2]. 

4 See Section 5 for a definition of vertex map G(v). 
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for all Delaunay D-stars Dpei(v). However, he and Ferguson (his student) [FH] discov- 
ered that a pentagonal prism configuration comes very close to violating the inequality 
(3.14). Furthermore, there turned out to be many similar difficult configurations which 
might possibly violate the inequality. These and other difficulties indicated that it was 
not numerically feasible to prove (3.14) by a computer proof, assuming that (3.14) is 
actually true. 

Hales and Ferguson together then further modified both the partition rule P and the 
scoring rule o to obtain a rule with the following properties: 


(i) It makes the score inequality stronger on the known bad cases related to the 
pentagonal prism configuration. 

(ii) It uses a more complicated notion of “cluster,” which includes Voronoi pieces as 
well as D-sets, and which retains the “decoupling” property that it is completely 
determined by vertices of Q in the cone above it. 

(111) It chooses a scoring rule which when combined with “truncation” on clusters 
is still strong enough to rule out most configurations. The “truncation” opera- 
tion greatly reduces the number of configurations to be checked, at the cost of 
weakening the inequality to be proved. 


In Section 4 we give a precise description of the Hales—Ferguson rules (P, o). 


4. Hales—Ferguson Partition Rule and Score Function 


Ferguson and Hales [FH] use the following partition rule and scoring rule. The partition 
uses two types of D-simplices, with a complicated rule for picking which ones to include 
as D-sets in the partition. Modified Voronoi domains V(v) are used as V-sets. These 
differ from the usual Voronoi domain (with the D-sets removed) by mutually exchanging 
some regions called “tips.” The scoring rule is also complicated: the weight function used 
ona D-simplex no longer depends on just its shape, but depends on the structure of nearby 
D-sets. 
We begin by defining the two types of D-simplices. 


Definition 4.1. A QR-tetrahedron (or quasi-regular tetrahedron) is any tetrahedron 


; riper 251 
with all vertices in Q and all edges of length < 7. 


Definition 4.2. A QL-tetrahedron (or quarter) is any tetrahedron with all vertices in 


Q and five edges of length < ot and one edge with length ot < 1 < 2,/2. The long 


edge is called the spine (or diagonal) of the QL-tetrahedron. 


For some purposes? the case of a spine of length exactly ot should be considered 


as either a QLl-tetrahedron or a QR-tetrahedron. Here we treat it exclusively as a QR- 
tetrahedron. 


5 In proving inequalities one wants to work on a compact set. In compactifying the space of configurations, 
this requires allowing the lower inequality in the definition of the QL-tetrahedron to be an equality. 
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Neither kind of tetrahedron is guaranteed to be included in the Delaunay triangulation 
of Q, but we do have: 


Lemma 4.1. All QR-tetrahedra and QL-tetrahedra are D-simplices. 


The Hales—Ferguson partition rule starts by selecting which D-simplices to include 
in the D-system. These consist of: 


(i) All QR-tetrahedra. 

(ii) Some QL-tetrahedra. The QL-tetrahedra included in the partition satisfy the com- 
mon spine condition which states that for a given spine, either all QL-tetrahedra 
having that spine are included or none are. 


This collection of tetrahedra is required (by the definition of an admissible partition) to 
form a non-overlapping set, where we say that two sets S; and Sj overlap if SM Sp has 
positive Lebesgue measure in R*. To show this is possible with (i) holding we have: 


Lemma 4.2. No two QR-tetrahedra overlap. 


QL-tetrahedra may overlap QR-tetrahedra or other QL-tetrahedra, hence one needs 
a rule for deciding which QL-tetrahedra to include. To begin with, QL-tetrahedra can 
overlap QR-tetrahedra in essentially one way. 


Lemma 4.3. [fa QL-tetrahedron and QR-tetrahedron overlap, then the QR-tetrahedron 
has a common face with an adjacent QR-tetrahedron, and the two unshared vertices of 
these QR-tetrahedra are the endpoints of the spine of the QL-tetrahedron. The union of 
these two QR-tetrahedra can be partitioned into three QL-tetrahedra having the given 
spine, which includes the given QL-tetrahedron. Aside from these QL-tetrahedra, no 
other QL-tetrahedron overlaps either of these two QR-tetrahedra. 


This lemma shows that the QL-tetrahedra having a given spine have the property that 
either all of them or none of them overlap the set of QR-tetrahedra. We next consider 
how QL-tetrahedra can overlap other QL-tetrahedra. The following configuration plays 
an important role. 


Definition 4.3. A Q-octahedron is an octahedron whose six vertices v; € (2 and whose 


251 
twelve edges each have lengths 2 </ < 7. 


A Q-octahedron has three interior diagonals. If a diagonal has length 2 </ < 7, 


then it partitions the Q-octahedron into four QR-tetrahedra. If a diagonal has length 


ot <1 < 2¥2, then it partitions the Q-octahedron into four QL-tetrahedra of which 


it is the common spine. If a diagonal has length 7 > 2/2 it yields no partition. A Q- 
octahedron thus gives between zero and three different partitions into four OR-tetrahedra 
or QL-tetrahedra. We call it a live Q-octahedron if it has at least one such partition. 
Lemma 4.3 implies that if it has a partition into QR-tetrahedra, then it has no other 
partition into OR-tetrahedra or QL-tetrahedra. 
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Lemma 4.4. A QL-tetrahedron having a spine which is a diagonal of a Q-octahedron 
does not overlap any QL-tetrahedron whose spine is not a diagonal of the same Q- 
octahedron. 


The selection rules for choosing QL-tetrahedra to include in the D-system require 
that one either includes all QL-tetrahedra having the same spine or else none of them. 
Thus the selection rule really specifies which spines to include. 


Definition 4.4. Consider an edge [v, v2] with v,, V2 € Q and on < |/Vi—Vvo|| < 24/2. 
A vertex w € Q is called an anchor of the edge [v,, va] if 
Iw-vill< 2 for i=1,2. 

Ferguson and Hales use the number of QL-tetrahedra having a spine [v;, v2] and 
the number of anchors of that spine in deciding which QL-tetrahedra to include in the 
D-system. Call a QL-tetrahedron isolated if it is the only QL-tetrahedron on its spine 
[v,, V2]. The inclusion rule for an isolated QL-tetrahedron is: 


(QLO) An isolated QL-tetrahedron is included in the D-system if and only if it 
overlaps® no other QL-tetrahedron or QR-tetrahedron. 


Next consider spines [v;, V2] which have two or more associated QL-tetrahedra. Such 
spines have at least three anchors, and the inclusion rules are: 


(QL1) Each non-isolated QL-tetrahedron on a spine with five or more anchors is 
included in the D-system. 

(QL2) Each non-isolated QL-tetrahedron on a spine with four anchors is included in 
the D-system, if the spine is not a diagonal of some Q-octahedron. In the case 
of a live Q-octahedron, we include all QL-tetrahedra having one particular 
diagonal, and exclude all QL-tetrahedra on other diagonals. For definiteness, 
we choose the spine to be the shortest diagonal. In the case of a tie for the 
shortest diagonal, a suitable tie-breaking rule is used. 

(QL3) Each non-isolated QL-tetrahedron on a spine with three anchors is included in 
the D-system if each QL-tetrahedron on the spine does not overlap any other 
QL-tetrahedron or QR-tetrahedron, or overlaps only isolated QL-tetrahedra. 
It is excluded from the D-system if some tetrahedron on the spine overlaps 
either a QR-tetrahedron or a non-isolated QL-tetrahedron having four or more 
anchors. Finally, if some tetrahedron on the spine overlaps a non-isolated QL- 
tetrahedron having exactly three anchors, then the spine of the overlapped set 
is unique, and exactly one of these two sets of non-isolated QL-tetrahedra with 
three anchors is to be included in the D-system, according to a tie-breaking 
rule.’ 


© Tt cannot overlap a QR-tetrahedron by Lemma 4.3. 

7 The tie-breaking rule could be to include the spine with the lowest endpoint using a lexicographic ordering 
of points in R?. It appears to me that Hales would permit an arbitrary choice of which one to include, see 
Lemma 4.5 (iii). 
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The set of QL-tetrahedra selected above are pairwise disjoint and are disjoint from all 
QR-tetrahedra. This is justified by the following lemma. 


Lemma 4.5. 


(i) If two QL-tetrahedra overlap, then at most one of them has five or more 
anchors. 

(ii) If two overlapping QL-tetrahedra each have four anchors, then their spines are 
(distinct) diagonals of some Q-octahedron. 

(iii) If a non-isolated QL-tetrahedron with three anchors overlaps another QL- 
tetrahedron having three anchors, then each of their spines contains exactly 
two non-isolated QL-tetrahedra, and these four QL-tetrahedra overlap no other 
QL-tetrahedron or QR-tetrahedron. 


We call the set of QR-tetrahedra and QL-tetrahedra selected as above the Hales— 
Ferguson D-system. (Hales and Ferguson call this a Q-system.) 

We now define the V-cells of the Hales—Ferguson partition. To begin, we take the 
Voronoi domain V,o;(v) at vertex v and remove from it the union of all D-simplices in the 
D-system to obtain a reduced Voronoi region V,eq(v). Next we move certain regions of 
Vrea(v) called “tips” to neighboring reduced Voronoi regions to obtain modified regions 
Vmoa(Vv) and finally we define the V-cell V (vy) at v to be the closure of Vinoa(V). 


Definition 4.5. Let T be any tetrahedron such that the center x = x(T) of its circum- 
scribing sphere lies outside T. A vertex v of T is negative if the plane H determined 
by the face F of T opposite v separates v from x. The “tip” A(T, v) of T associated 
to a negative vertex v is that part of the Voronoi region of v with respect to the points 
{V1, Vo, V3, V4} that lies in the closed half-plane H* determined by H that contains x. 
The “tip” region A(T, v) does not overlap 7, and is a tetrahedron having x as a vertex, 
and has three other vertices lying on H. 


Lemma 4.6. 


(i) A QR-tetrahedron or QL-tetrahedron T has at most one negative vertex. 
(ii) Ifa negative vertex is present, then the three vertices of the associated “tip” that 
lie on H actually lie in the face of T opposite to the negative vertex. 
(iii) The “tip” of any tetrahedron in the Hales—Ferguson D-system either does not 
overlap any D-simplex in the Hales—Ferguson D-system, or else is entirely 
contained in the union of the D-simplices in the D-system. 


We say that a “tip” that does not overlap any D-set is uncovered. The lemma shows 
that uncovered “tips” lie in the union of the Voronoi regions {V;eq(w): w € QQ}, so that 
rearrangement of uncovered “tips” is legal. There is an a priori possibility that two “tips” 
may overlap® each other. 


8 T do not know if this possibility can occur. 
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1 ; 
9 (Vo + V1) 1 (vo + v3) 


vo 


Fig. 1. “Tip” of D-simplex [v1, v2, v3]. 


Uncovered Tip Rearrangement Rule. Each y € R?® that belongs to an uncovered 
“tip” is reassigned to the nearest vertex w € Q such that y is not in an uncovered “tip” of 
any pair (T, w) where T is in the Hales—Ferguson D-system and w is a negative vertex 
of T. (A tie-breaking rule is used if the two nearest vertices w are equidistant.) 


This rule cuts an uncovered “tip” into a finite number of polyhedral pieces and reas- 
signs the pieces to different reduced Voronoi regions. This prescribes how Vimoa(V¥) is 
constructed, and thus defines the Hales—Ferguson V-cells V(v). 

A two-dimensional analogue of a “tip” is pictured? in Fig. 1. In this figure the triangle 
T = [%1, V2, V3] plays the role of a D-simplex, with v2 as a negative vertex and the “tip” 
is the shaded region. The points ¢g12, €913, C)23 are centroids of the triangles determined 
by the corresponding v,’s. The shaded triangle [¢o12, €o13, C123] is in the Voronoi cell 
Vyor(Vo) while the remainder of the “tip” is in the Voronoi cell Vyor (V2). The uncovered tip 
rearrangement rule partitions the part in V,o, (V2) into three triangles which are reassigned 
to the V-cells V(vo), V(v;), and V (v3), e.g., [¥1, y2, Coi2] is reassigned to V(v,). The 
reassignment of the “tip” ensures that the pointed cone over v2 generated by the D- 
simplex [V;, V2, V3] does not contain any part of the V-cell at v2. In this example the 
V-cell at Vo does not feel the effect of the vertex v2, due to the rearrangement. 

We now turn to the Hales—Ferguson scoring rules. These use the compression function 
I'(S) given in (3.9). The compression function is additive: If S = S$; U S> is a partition, 


° See Figure 2.1 of Hales [SP-II] for another example. 
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then 
P(S) = P(S1) + P(S2). (4.1) 
For a D-simplex T, 
4 ‘ 
(solid angle); 
vol(T) p(T) = > ee (4.2) 
i=l 
where a full solid angle is 477. 
The Hales—Ferguson weight function for a V-cell is as follows: 
(S1) Fora V-cell V(v), 
41 (V(v)) if v=w, 
our (V (Vv), W) = (4.3) 


0 if vAw. 


We next consider the weight function for D-sets. Let (T, v) denote a D-simplex 
together with a vertex v of it. 


Definition 4.6. The Voronoi measure vor(T, Vv) is defined as follows. If the center of 
the circumscribing sphere of T lies inside T, then T is partitioned into four pieces: 


Voor(T, Vi) = {x € T: |Ix—vill < Ix—vjll for 1 <j < 4} 


vor 
and then 


vor(T, v) := (Vt (T,v)). (4.4) 


vor 


There is an analytic formula for the right side of (4.4) given in Appendix A, and this 
formula is used to define vor (7, v) in cases where the circumcenter falls outside T. 


In cases where the circumcenter is outside 7, and v is a negative vertex, then 
vor(T, Vv) = V(Yvor(T, v) U “tip”) (4.5) 
while for the other three vertices parts of the “tip” are counted with a negative weight, 


in such a way that 


4 


> vor(T, v;) = 4T(T) (4.6) 


i=l 
holds in all cases. The weight function for a D-set is given as follows: 


(S2) For a QR-tetrahedron T = D(v}, V2, V3, V4) in the D-system, 


T(T) if the circumradius of T is at most im 
our(T, v) = é ‘ : 141 (4.7) 
vor(T, Vv) if the circumradius of T exceeds io0° 
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The QL-tetrahedron scoring function is complicated. For a QL-tetrahedron T, let 7*(T) 
be the maximum of the circumradii of the two triangular faces of T adjacent to the spine 
of T, and define the function 

(7) if nt(T) < V2, 
vor(T,v) if nt(T) > V2. 


u(T, Vv) = (4.8) 


Then the QL-tetrahedron scoring function is defined by: 
(S3) (“Flat quarter” case) For a QL-tetrahedron T and a vertex v not on its spine, 
our(T, Vv) := LCT, V). (4.9) 


(S4) (“Upright quarter” case) For a QL-tetrahedron T with vertex v on its spine, let V 
denote the opposite vertex on the spine. If T is an isolated QL-tetrahedron, set 


oOur(T, Vv) := WT, VY). (4.10) 
If T is part of a Q-octahedron, set 
our(T, v) = 5(u(T, v) + W(T, ¥)). (4.11) 
In all other cases, set 
our(T, v) = 3(M(T, v) + M(T, 9) + 3(voro(T, v) — voro(T,%)), (4.12) 


in which voro(T, v) is a “truncated Voronoi measure” that only counts volume 
within radius 5 (3) of vertex v, which is defined in Appendix A, and on pp. 9-11 
of [FH]. 


The scoring rule (S4) is the most complicated one. In it the definition (4.11) plays 
an important role in obtaining good bounds for the pentagonal prism case treated in 
[SP-V], while the definition (4.12) is important in analyzing general configurations 
using truncation in [SP-IV]. 


Theorem 4.1. The Hales—Ferguson partition rule and scoring rule (Pyp, Op) are both 
admissible, with score constants A = 4 and B = 46oc- 


Proof. It is easy to verify that the definitions for scoring QR-tetrahedra and QL- 
tetrahedra satisfy the weighted density average property 


4 
\- our(T, vi) = 40 (7), (4.13) 


i=1 


which correspond to A = 4 and B = 46oct, using (4.6). Most of the remaining admis- 
sibility conditions are verified by Lemmas 4.1-4.6 except for locality. For locality, a 
conservative estimate indicates that the rules for removing and adding “tips” to deter- 
mine the V-cell V(v) are determined by sphere centers w € Q with ||w — v|| < 124/2. 
Finally the score function on the D-simplices is determined by vertices within distance 
6/2 of v. O 
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Based on this result, Theorem 2.1 associates to (Pur, OF) a sphere-packing bound; 
the Hales program asserts this bound will be optimal. To establish the Kepler bound 


a 
pli) = ——, (4.14) 
V18 
via (2.15), one must prove that 
8 := OP yp oup (4, 45 oct) = 8 pt, (4.15) 
where 
t ae 12 ( : ) 0.0553736 (4.16) 
:= —— — 12arccos | — } ~ 0. : : 
7 3 V3 


The score function Score(Dyp(v)) is discontinuous as a function of the sphere centers 
in Q near v, because it is a sum of contributions of pieces which may appear and disappear 
as sphere centers move, and discontinuities occur when QL-tetrahedra convert to QR- 
tetrahedra. To deal with this, one compactifies the configuration space by allowing some 
sphere center configurations to have more than one legal decomposition into pieces (but 
at most finitely many). The optimization problem can then be split into a finite number 
of subproblems on each of which oyp is continuous. 

The complexity of the definition of (Pur, our) is designed to yield a computation- 
ally tractable nonlinear optimization problem. The introduction of QL-tetrahedra and 
the complicated score function on them is designed to help get good bounds for the 
pentagonal prism case and similar cases. The rule for moving “tips” is intended to fa- 
cilitate decomposition of the nonlinear optimization problem into more tractable pieces 
via Theorem 5.5 below, and the use of “truncation.” 


5. Kepler Conjecture 
The main result to be established by the Hales program is the following. 


Theorem 5.1 (Main Theorem). For the Hales—Ferguson partition rule and scoring 
rule (Pyp, OnF), and any v € Qin a saturated sphere-packing, the vertex D-star Dyr(v) 
at V satisfies 


Score(Dyp(v)) < 8pt, (5.1) 
where pt := 11m/3 — 12 arecos(1/V3) ~ 0.0553736. 


The Kepler conjecture follows by Theorem 2.1. 

To prove inequality (5.1), by translation-invariance we can reduce to the case v = 0 
and search the set of all possible vertex stars, which by Section 4 are determined by 
those points w € Q with ||w|| < 12/2. The space of possible sphere centers {w € 
Q: ||wl| < 12/2} is compact. It can be decomposed into a large number of pieces, 
on each of which the score function is continuous. To obtain compact pieces, we must 
compactify the configuration space by assigning more than one possible local D-star 
D(v) to certain arrangements of sphere centers. The compactification assigns at most 
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finitely many possibilities to each arrangement, with an absolute upper bound on the 
number of possibilities. 

In what follows we generally assume that 0 € Q and we study the vertex star at v = 0. 
However, we state definitions and lemmas to be valid for general Q. 

The definition of the score Score(D(v)) involves a sum over the V-sets and D-sets. 
The usefulness of the compression measure I"(S) is justified by the following lemma. 


Lemma 5.1. 


(i) Every QR-tetrahedron T satisfies 
I(T) < pt, (5.2) 


with equality occurring only when T is a regular tetrahedron of edge length 2. 
Gi) A QL-tetrahedron T has 


I(T) <0, 33) 


with equality occurring for those T having five edges of length 2 and a spine of 
length 2/2. 


Result (ii) illustrates a somewhat counterintuitive behavior of the local density func- 
tion: when holding five edges of a tetrahedron fixed of length 2, and allowing the sixth 
edge to vary over ot < 1 < 2,/2, the local density measure is largest for a spine of 
maximal length. 

The vertices w € Q with ||w|| < i play a particularly important role, for they 


determine all QR-simplices of Q containing 0 as a vertex. 


Definition 5.1. The planar map (or graph) G(v) associated to a vertex v € Q consists 
of the radial projection onto the unit sphere 9 B(v; 1) = {x € R*: ||x—v|| = 1} centered 


at v of all vertices w € Q with ||w — v|| < a plus all those edges [w, w’] between two 


such vertices which have length ||w — w’|| < a 

Here we regard the planar map G(v) as being given with its embedding as a set of arcs 
on the sphere. The following lemma asserts that no new vertices are introduced other 
than those coming from points of Q with ||w|| < 7- 
Lemma 5.2. The radial projection of two edges [w,, W2], [w,, W5] as above onto the 
unit sphere 0 B(v; 1) give two arcs in G(v) which either are disjoint or which intersect 


at an endpoint of both arcs. 


We study local configurations classified by the planar map G(v). The planar map G(0), 
which is determined by the vertices ||w|| < a does not in general uniquely determine 
the vertex D-star Dyp(0), but does determine all points x in it with ||x|| < one 
Definition 5.2. The part of Dyr(v) that lies in the pointed cone with base point v 
determined by a face of the map G(v) is called the cluster over that face. Note that the 
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face need not be convex, or even simply connected—it could be topologically an annulus, 
for example. 


The following lemma shows that the vertex D-star Dyp(0) can be cut up into clusters 
in a way compatible with the scoring function. 


Lemma 5.3. Each QR-tetrahedron or QL-tetrahedron in the D-star Dyg(0) is con- 
tained in a single cluster. Furthermore, all such tetrahedra having a common spine are 
contained in a single cluster. 


In effect the partition of the vertex D-star into clusters partitions the V-cell into 
smaller pieces, while leaving the D-sets unaffected. The scoring function is additive 
over any partition of a V-cell into smaller pieces, according to (4.1) and (4.3). The score 
Oyr(F) of the cluster determined by a face F of G(v) is the sum of the scores of the 
QR-tetrahedra and QL-tetrahedra in the cluster, plus the Voronoi score 41°(R) of the 
remaining part R of the cluster. We then have 


Score(Dyp(v)) = > our(F). (5.4) 
FeG(v) 


We now consider clusters associated to the simplest faces F' in the graph G(v). Each 
triangular face corresponds to a QR-tetrahedron in Dyr(v), and, conversely, each QR- 
tetrahedron in Dyr(v) produces a triangular face. A quad cluster is a cluster over a 
quadrilateral face. A Q-octahedron with spine ending at 0 results in a quadrilateral 
face, but there are many other kinds of quad clusters. In the case of faces F with > 5 
edges, the cluster may consist of a V-cell plus some QL-tetrahedra, in many possible 
ways. All the possible decompositions into such pieces have to be considered as separate 
configurations. 


Lemma 5.4. 


(i) A cluster over a triangular face F consists of a single QR-tetrahedron, and 
conversely. The score of such a cluster is at most \pt, and equality holds if and 
only if it is a regular tetrahedron of edge length 2. 

(ii) The sum of the score functions over any quad cluster is at most zero. Equality 
can occur only if the four sphere centers v; corresponding to the vertices of the 
quad cluster each lie at distance 2 from v and also from each other, if they share 
an edge of the quad cluster. 

(iii) The score of a cluster over any face with five or more sides is strictly negative. 


The extremal graphs where equality is known to occur in (5.1) have eight triangular 
faces and six quadrilateral faces. The upper bound of 8 pt for these cases is implied by 
this lemma. (It appeared first in [SP-II, Theorem 4.1].) 

The following result rules out graphs G(v) with faces of high degree. 

Theorem 5.2. All decomposition stars Dyp(0) with planar maps G(0) satisfy 


Score(Dyp(0)) < 8pt (5.5) 
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unless the planar map G(O) consists entirely of (not necessarily convex) faces of the 
following kinds: polygons having at most eight sides, in which pentagons and hexagons 
may contain an isolated interior vertex or a single edge from an interior vertex to an 
outside vertex, and a pentagon may exclude from its interior a triangle with two interior 
vertices. 


There remain a finite set of possible map structures that satisfy the conditions of 
251 


Theorem 5.2. Here we use the fact that there can be at most 50 vertices v with ||v|| < 7. 
The list is further pruned by various methods, and reduced to about 5000 cases. Since the 
(putative) extremal cases are already covered by Lemma 5.4, in the remaining cases one 
wishes to prove a strict inequality in (5.1), and such bounds can be obtained in principle 
by computer. 

Most of the remaining cases are eliminated by linear programming bounds. The linear 
programs obtain upper bounds for the score function Score(Dyr(0)) for a planar map G 
of a particular configuration type, using the score function as the objective function, in 


the form 


Maximize Score(Dyp(0)) := ~~ o(F), (5.6) 


faces 
F 
where the variable o(F’) is the sum of weights associated to the cluster over the face 
F.. The use of linear programming relaxations of the nonlinear program seems to be a 
necessity in bounding the score function. For example, the compression function '(R) 
for different regions R is badly behaved: it is neither convex nor concave in general. The 
linear constraints include hyperplanes bounding the convex hull of the score function 
over the variable space. 
One can decouple the contributions of the separate faces F of G(v) using the following 
result. 


Lemma 5.5 (Decoupling Lemma). Let v € Q be a vertex of a saturated packing and 
let F be a face of the associated planar map G(v), and let Cr denote the (closed) pointed 
cone over F with vertex v, and let Crea denote the closure of the cone over F obtained 
by removing from Cr all cones over D-sets with a corner at v. Then the portion of the 
V-cell V (v) that lies in Cr is completely determined by the vertices of Q2 that fall in the 
smallest closed convex cone Cr containing Cr. In particular, 


Ve = Viv) NCr = V(QNCz, V) NC rea. (5.7) 


To obtain such a decoupling lemma requires the exchange of “tips” between Voronoi 
domains, as described in Section 4. 

The decoupling lemma permits the score function o (V (v), v) to be decomposed into 
polyhedral pieces that depend on only a few of the nearby vertices. This decomposes the 
problem into a sum of smaller problems, to bound the scores of the pieces 0 (V (v)NC Fr, Vv) 
in terms of these vertices. It will often be applied when the face F is convex, in which 
case Cp = Cp. 
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A futher very important relaxation of the linear programs involves “truncation.” The 
truncated V-cell is 


Verune(¥) = Vv) N BCv: 3). (5.8) 


The following lemma says that we may choose whether to apply truncation or not 
separately to the parts of the V-cell which lies over each face of G. 


Lemma 5.6. Let F be a face of G(v) and let Cr be the cone over that face. The region 
Verunc(V) NC ¢ is entirely determined by the vertices of G(v) inCr. If Vr denotes this set of 
vertices, together with v, then this region is the closure of (Vyor(V, V) AC) — {D-sets}. 
The compression function satisfies the bound 


P(Virune(V) 1 Cr) = PV (v) NC). (5.9) 


Inequality (5.9) implies that replacing a Voronoi-type region by a truncated region 
can only increase the score, hence one can relax the linear program by using the score of 
truncated regions. If one is lucky the linear programming bounds using truncated regions 
will still be strong enough to give the desired inequality. The use of truncation greatly 
reduces the number of configurations that must be examined. Truncation bounds were 
also used in proving Theorem 5.2 above. 

We add the following remarks about the construction of the linear programming 
problems: 


(1) For each face F of a given graph type G, Hales and Ferguson construct a large 
number of linear programming constraints in terms of the edge lengths, dihedral 
angles, and solid angles of the polyhedral pieces making up the cluster of Dyr(v) 
over face F of the graph G. The edge lengths, dihedral angles, and solid angles 
are variables in the linear program. Some of the constraints embody geometric 
restrictions that a polyhedron of the given type must satisfy. Others of them 
are inequalities relating the weight function of the polyhedron, which is also 
a variable in the linear program, to the geometric quantities. The inequalities 
bound the score function on the cluster (either as a V-cell or as D-sets) in terms 
of these variables. There are also some global constraints in the linear program, 
for example, that the solid angles of the faces around v add up to 47. 

(2) The weight function for D-sets does not permit subdivision of the simplex, but 
the weight function on the V-cell is additive under subdivision, so one can cut up 
such regions into smaller pieces if necessary, to get improved linear programming 
bounds, by including more stringent constraints. 

(3) In the linear programming relaxation, a feasible solution to the constraints need 
not correspond to any geometrically constructible vertex D-star. All that is re- 
quired is that every vertex D-star of the particular configuration type corresponds 
to some feasible point of the linear program. 


In this fashion one obtains a long list of linear programs, one for each configuration 
type, and to rule out a map type G one needs an upper bound for the linear program’s 
objective function strictly below 8pr. To obtain such an upper bound rigorously, it suffices 
to find a feasible solution to the dual linear program, and to obtain a good upper bound 
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one wants the dual feasible solution close to a dual optimal solution. The value of the dual 
linear program’s objective function is then a certified upper bound to the primal linear 
program. To obtain such a certification, it is useful to formulate the linear programs so 
that the dual linear program has only inequality constraints, with no equality constraints, 
so that the feasible region for it is full-dimensional. This way, one can guarantee that 
the dual feasible solution is strictly inside the dual feasible region which facilitates 
checking feasibility. This is necessary because the linear program put on the computer 
is only an approximation to the true linear program. For example, certain constraints 
of the true linear program involve transcendental numbers like zr, and one considers an 
approximation. The effect of these errors is to perturb the objective function of the dual 
linear program. Thus a rigorous bound on the effect of these perturbations on the upper 
bound can be obtained in terms of the dual feasible solution. In this way one can (in 
principle) get a certified upper bound!° on the score for a map type G, using a computer. 

The linear programming bounds in the Ferguson—Hales approach above suffice to 
eliminate all map types G not ruled out by Theorem 5.2 except for about 100 “bad” 
cases. These are then handled by ad hoc methods. I have not studied the details about 
how these remaining “bad” cases are handled. Presumably they are split into smaller 
pieces, extra inequalities are generated somehow, and perhaps specific information on 
the location of vertices of more than a is incorporated into the linear programs. 


6. Concluding Remarks 


The Kepler conjecture appears to be an extraordinarily difficult nonlinear optimization 
problem. The “configuration space” to be optimized over has an extremely complicated 
structure, of high dimensionality, and the function being optimized is highly nonlinear 
and nonconvex, and lacks good monotonicity properties. The crux of the Hales approach 
is to select a formulation of an optimization problem that can be carried out (mostly by 
computer) in a reasonable length of time. This led to the Hales—Ferguson choice of a 
very complicated partition and score function, giving an inelegant local inequality, which 
however has good decomposition properties in terms of the nonlinear program. Much of 
the work in the proof lies in the reductions to reasonable sized cases and the use of linear 
programming relaxations. The elimination of the most complicated cases in Theorem 
5.2 was a major accomplishment of this approach. The use of Delaunay simplices to 
cover most of the volume where density is high seems important to the proof and to the 
choice of score functions, since simple analytic formulas are available for tetrahedra. 
The Hales—Ferguson proof, assumed correct, is a tour de force of nonlinear optimization. 

In contrast, the Hsiang approach formulates a relatively elegant local inequality, in- 
volving only Voronoi domains and a fairly simple weight function: only nearest neighbor 
regions are counted. It is conceivable that a rigorous proof of the Hsiang inequality can 
be established, but it very likely will require an enormous computer-aided proof of a 
sort very similar to the Hales approach. Voronoi domains do not seem well suited to 


10 The Hales proof in the preprints used a linear programming package CPLEX that does not supply such 
certificates. Therefore the linear programming part of the Ferguson—Hales proof needs to be re-done to obtain 
guaranteed certificates. 
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computer proof: they may have 40 or more faces each, and the Hsiang approach requires 
considering up to 20 of them at a time. A computer-aided proof would likely have to 
dissect the Voronoi domains into pieces, further increasing the size of the problem. 
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Appendix A. Hales Score Function Formulas 


These definitions are taken from Section 8 of [SP-I] and pp. 8-11 of [FH]. A tetrahedron 
T (hy, ly, 1, ..., 16) is uniquely determined by its six edge lengths /;. Let the vertices of 
T be Vo, Vi, V2, V3 and number the edges as 


i; = \lvo-—vil| for 1 <i <3, ly = |lv2 — vall, ls = |lvi — val, 


and Ig = |/v; — voll. (A.1) 


We take vp = 0 for convenience. 

Suppose that the circumcenter w, = w,(T) of T is contained in the pointed cone over 
vertex Vo, determined by 7. Let i, denote the part of the Voronoi cell of Vo with respect 
to the set Q = {Vo, Vi, V2, V3} of vertices of T that lies in 7. Suppose in addition that the 
three faces of T containing Vo are each non-obtuse triangles. Then the set Ty subdivides 
into six pieces, called Rogers simplices by Hales [SP-I, p. 31]. A Rogers simplex in T is 
the convex hull of vo, the midpoint of an edge emanating from vo, the circumcenter of 
one face of T containing that edge, and the circumcenter w, = w,(T). If a denotes the 
half-length of an edge, b the circumradius of a face, and c = |ly-|| is the circumradius 
of 7, then the associated Rogers simplex has shape 


RGR TE bee Hb) aay? Cay), (A.2) 
with the positive square root taken. The intersection of a unit sphere centered at 0 with 
R(a, b, c) has volume i Sol(vo; R(a, b, c)), where Sol(vo; R(a, b, c)) denotes the solid 
angle of R(a, b, c) at Vp, normalized so that a total solid angle is 427. The values 

x =? (A.3) 


are the squares of the edge lengths. 


Lemma A.1. The solid angle Sol(vo, T) of a tetrahedron T (I, ly, Is, la, Is, 16) is given 
by 


2A 
Sol(vo, T) := 2 arccot Ain (A.4) 
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in which the positive square root of A is taken, the value of arccot lies in [0, zr], 
Ahi, by Is, lay ds, Is) = hb+$h(G+G—-1)+ 3h t+G -1)+ 5b +G-h) (AS) 
and 
A(h, la, bs, la, Is, Ie) 
=PR-B+Bb+8-848-2)+88G@-84+8+0-2+2 
+BRG+B-B8+0+82-% 
BEG -— GBR — AER — ERR. (A.6) 


Definition A.1. 


(i) For a tetrahedron T(/,, ,...,16) with vertex vo, if the circumcenter w, of T 
falls inside the cone determined by T at vo, then we set 


6 
vor (T, Vo) := 45 “{vol(Ri (a, b, c))(—Soct) + : Sol(Ri, Vo)} (A.7) 


i=1 
with 
a(b? = a’)!/2(¢2 = b?)!/2 


vol(R(a, b,c)) := 6 : 


for l1<a<b<ce. (A.8) 


This formula satisfies vor(T, Vo) = AT (Tp). 

(ii) The six tetrahedra R;(a, b, c) are still defined even when the circumcenter w, 
falls outside the cone of T at vertex vg, and we still take the formula (A.7) to 
define vor(T, Vo), except that both vol(R; (a, b, c)) and Sol(R;, Vo) are counted 
with a negative sign: each tetrahedron R;(a, b,c) falls outside T, and has no 
interior in common with it. 


Hales calls the definition (ii) the “analytic continuation” of case (i). It has a geometric 
interpretation. 

The truncated Voronoi function vor (T, Vo; t) of atetrahedron T at vertex Vo is intended 
to measure the compression ry 1 B(vo; t)). Here we have truncated the region Ty by 
removing from it all points at distance greater than t from vp. We set 


voro(T, Vo) := vor(T, Vo, oh), (A.9) 
The definition 
vor(T, Vo; t) = (To M B(vo; t)) (A.10) 


is valid only when the circumcenter w, of T lies in the cone generated from T at vertex 
vo. In the remaining case one must construct an analytic representation analogous to 
(A.7) for vor (T, Vo; t). This is done on pp. 9-10 of [FH]. 
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Appendix B. References to the Hales Program Results 


This paper was written to state the Hales—Ferguson local inequality in as simple a way 
as I could find, and does not match the order in which things are done in the preprints 
of Hales and Ferguson. Also, the lemmas and theorems stated here are not all stated 
in the Hales and Ferguson preprints; some of them are based on talks that Hales gave 
at IAS in January 1999. The pointers below indicate where to look in the preprints for 
the results I formulate as lemmas and theorems. Warning: The Hales—Ferguson partition 
and scoring function given in [FH], which are the ones actually used for the proof of the 
Kepler conjecture, differ from those used earlier by Hales in [SP-I] and [SP-II]. 


(0) The idea of considering local inequalities that weight total area and covered area 
by spheres in a ratio B/A that is not equal to the optimal density occurs in Hales’ 
original approach based on Delaunay triangulations, see [Hal] and [Ha2]. It also 
appears in [SP-I, Lemma 2.1] and in [FH, Proposition 3.14]. I have inserted the 
parameters A and B in order to include the density inequality of Hsiang [Hs1] in 
the same framework. 

(1) Definitions 4.1 and 4.2 appear on p. 2 of [FH]. 

(2) Lemma 4.1 is Lemma 1.2 of [FH], proved in Lemma 3.5 of [SP-I]. (The fact 
that no vertex of Q occurs inside a face of a QL-tetrahedron or a OR-tetrahedron 
requires additional argument.) 

(3) Lemma 4.2 follows from Lemma 1.3 of [FH]. 

(4) Lemma 4.3 is proved at the bottom of p. 3 of [FH]. 

(5) Lemma 4.4 is covered in the discussion on pp. 5-6 of [FH]. 

(6) Lemma 4.5(i)—(iii) is covered in the discussion on p. 5 of [FH], including Lem- 
ma 1.8 of that paper. 

(7) The notion of “tip” is discussed at length in Section 2 of [SP-I]. In part II “tips” 
are not actually reassigned, although this is mentioned. Instead their existence 
affects (i.e., is encoded in) the scoring rule used for the associated Delaunay 
simplex which the “tip” is associated to. The rules for moving “tips” around to 
make V-cells in the Hales—Ferguson approach are discussed on p. 8 of [FH]. 
Warning: The way that “tips” are handled in part I and in [FH] may not be the 
same: [FH] takes priority. 

(8) Lemma 4.6(i) is Lemma 2.2 of [SP-II] and Lemma 4.17 of [FH]. Facts related to 
(ii) are discussed in Section 8.6.7 of [SP-I]. (For the second part I do not have a 
reference.) (iii) Hales mentioned this in IAS lectures, and sent me a proof sketch, 
which I expanded into the following: Let S be a simplex in the D-system that 
overlaps a “tip” protuding from v. Say that the “tip” overlaps by pointing to $ 
along a face F of S. Thus F is a negatively oriented face of S’ = (F, v), which 
means that the simplex S’ is a QR-tetrahedron or else a QL-tetrahedron with spine 
on F’. Suppose first that S’ is a QL-tetrahedron. It now follows that S must be 
a QL-tetrahedron with its spine on F by Lemma 2.2 of [FH]. So S’ and S are 
adjacent QL-tetrahedra with spines on their common face F’. Now S is in the 
D-system since S’ is in the D-system. Thus the distance of v to the vertices in F 
is at most 2", since v is not on the spine. We now suppose that the “tip” is not 


100’ 
entirely contained in S, and derive a contradiction. If it is not contained in S, then 
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it crosses out through a face F” of S. By the same argument, the distance from v to 
251 


the vertices of F’ is at most a Thus v has distances at most 7, from all vertices 
of S, which is impossible by Lemmas 1.2 and 1.3 of [FH]. Suppose secondly 
that S’ is a QR-tetrahedron. Then one shows that S is also a QR-tetrahedron, 
hence is in the D-system. The rest of the argument goes as before, to the same 
contradiction. 

(9) Theorem 5.1. The main theorem is first stated as Conjecture 3.15 on p. 13 of [FH]. 
It is the theorem asserted to be proved in [KC]. 

(10) Lemma 5.1(j) appears as Lemma 3.13 in [FH]. Lemma 5.1 (ii) is a special case 
of Lemma 3.13 of [FH] for a quad cluster, which can consist of four congruent 
QL-tetrahedra. 

(11) The standard regions corresponding to the graph G(v) are defined on p. 4 of [FH]. 
(“Planar map that breaks unit sphere into regions.”’) 

(12) Lemma 5.2 follows from Lemma 1.6 of [FH], which implies that crossing lines 
come from QL-tetrahedra only. 

(13) Lemma 5.3 is an immediate consequence of my Lemma 5.2 and Lemma 1.3 of 
[FH]. 

(14) Lemma 5.4 appears as Lemma 3.13 in [FH]. 

(15) Theorem 5.2 follows from the corollary to Theorem 4.4 of [SP-IV]. See also 
Proposition 7.1 of [SP-III]. 

(16) Lemmas 5.5 and 5.6. These results are briefly stated at the bottom of p. 8 of [FH]. 
There are also some relevant details in Section 2.2 of [SP-II]. (I do not know an 
exact reference for a detailed proof.) 


Hales—Ferguson Terminology in 
terminology this paper 
(1) Decomposition star Vertex D-star 
(2) Quasiregular tetrahedron QR-tetrahedron 
(3) Quarter QL-tetrahedron 
(4) Diagonal (of quarter) Spine (of QL-tetrahedron) 
(5) Q-system D-system 
(6) Score a(R, v) Weight function o(R, v) 
(7) Standard cluster Cluster 
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Proof of the Kepler Conjecture 
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Introduction to Part II 


Part II contains the published issue of Discrete & Computational Geometry contain- 
ing the Hales-Ferguson proof of the Kepler Conjecture (Discrete Comput. Geom., 
36 (2006), 1-269). This consists of six papers together with a Guest Editors’ Fore- 
word. 


The Guest Editor’s Foreword include remarks on the proof and on the seven-year 
reviewing process. 


These six papers published in this issue represent the top of the proof tree. At 
the bottom layers of the proof are a large number of inequalities checked by com- 
puter. Collections of these inequalities appear in a database maintained by Annals of 
Mathematics, and also in another database maintained by T. C. Hales. 
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Guest Editors’ Foreword 


This issue of Discrete & Computational Geometry contains the detailed proof by T. Hales 
and S. P. Ferguson of the Kepler conjecture that the densest packing of three-dimensional 
Euclidean space by equal spheres is attained by “cannonball” packing. This is a landmark 
result. This conjecture, formulated by Kepler in 1611, was stated in Hilbert’s formulation 
of his 18th problem [8]. The proof consists of mathematical arguments and a massive 
computer veri cation of many inequalities. 

In 1953 Laszl6 Fejes T6th [1] indicated the possibility of proving the Kepler conjecture 
by relating it to a nonlinear optimization problem over a compact set, and in 1964 he 
proposed a specic such inequality [2, pp. 295-299]. Such optimization problems can 
generically be termed “local density inequalities” as they involve some weighted measure 
of density associated to a neighborhood of each sphere center in a packing. Establishing 
a local density inequality yields a result stronger than the Kepler conjecture, since it 
requires the existence of a packing maximizing the local inequality simultaneously at 
every sphere center. Thus different local density inequalities are different mathematical 
results, although each may imply Kepler’s conjecture as a corollary. Assuming that 
the optimization problem has a nite set of isolated global minima, and that the local 
minimality can be veri ed analytically in an open neighborhood of each of these minima, 
in principle the inequality can be proved by computer elsewhere using interval arithmetic. 
However, the size of the resulting optimization problems seemed far beyond the range 
of what could be solved by computer at that time, or even now, without new ideas to 
reduce the complexity of the problem. 

An approach to prove the Kepler conjecture in this fashion was developed by Hales 
starting about 1992 (see [3] and [4]). A signi cant problem lies in formulating a local 
density inequality of this kind that is expected to be valid, and which represents an opti- 
mization problem small enough to be solvable in a reasonable time. A de nite program 
to prove it was then initiated by Hales after 1995 in [5] and [6]. Further modi cations 
of the local density inequality were required to obtain a tractable problem, and in this 
formulation process, important contributions were made by S. P. Ferguson, a student of 
Hales whose doctoral thesis work forms the paper Sphere Packing V in this issue. Signif- 
icant innovations of Hales were needed to reduce the optimization problem to a tractable 
size. A rst version of a complete proof was presented in six preprints, one joint with his 
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Ph.D. student Ferguson, and one by Ferguson alone, posted on the mathematics arXiv in 
1998. 

A week-long workshop on the proof was given at the Institute for Advanced Study 
in January 1999. At that time the second editor wrote a summary of the top level of the 
proof tree, which was later included in [9]. The six papers were submitted to Annals 
of Mathematics and a team of reviewers was assembled by the rst editor. The main 
portion of the reviewing took place in a seminar run at Eotvos University over a 3 year 
period. Some computer experiments were done in a detailed check. The nature of this 
proof, consisting in part of a large number of inequalities having little internal structure, 
and a complicated proof tree, makes it hard for humans to check every step reliably. 
Detailed checking of specic assertions found them to be essentially correct in every 
case tested. The reviewing process produced in the reviewers a strong degree of conviction 
of the essential correctness of this proof approach, and that the reduction method led to 
nonlinear programming problems of tractable size. 

The second editor became involved in the reviewing process in January 2003. This 
resulted in a substantial revision and reorganization of the papers making clearer the top 
portion of the proof tree, and leading to a simpli cation of some de nitions. The papers 
now include extra remarks motivating the choice of the given local density inequal- 
ity, which is very complicated, and provide some reasons indicating why one should 
expect this particular local density inequality to hold. The revised version of the Fer- 
guson paper now includes an explicit error bound away from the optimal density. An 
abridged version of the proof (excluding the paper of Ferguson) appears in Annals of 
Mathematics by Hales [7], with computer les and programs available on the Annals of 
Mathematics website. The detailed version of the proof, representing revised versions 
of the six 1998 preprints, appears in this special issue of Discrete & Computational 
Geometry. 

Summaries of some aspects of this proof and other local density approaches to the 
Kepler conjecture are discussed in the Seminar Bourbaki report of Oesterlé [10] and 
in [9]. 

The reviewing of these papers was a particularly enormous and daunting task. The 
rst editor wishes to thank the following reviewers who substantially contributed to 
the process: Andras Bezdek, Michael Bleicher, Karoly Boroczky, Karoly Boroczky, Jr., 
Aladar Heppes, Wlodek Kuperberg, Endre Makai, Attila Por, Gunter Rote, Istvan Talata, 
Bela Uhrin, and Zoltan Ujvary-Menyhard. 
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Historical Overview of the Kepler Conjecture 


Thomas C. Hales 
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hales @pitt.edu 


Abstract. This paper is the first in a series of six papers devoted to the proof of the Kepler 
conjecture, which asserts that no packing of congruent balls in three dimensions has density 
greater than the face-centered cubic packing. After some preliminary comments about the 
face-centered cubic and hexagonal close packings, the history of the Kepler problem is 
described, including a discussion of various published bounds on the density of sphere 
packings. There is also a general historical discussion of various proof strategies that have 
been tried with this problem. 


1. Introduction 
1.1. The Face-Centered Cubic Packing 


A packing of spheres is an arrangement of nonoverlapping spheres of radius | in Eu- 
clidean space. Each sphere is determined by its center, so equivalently it is a collection 
of points in Euclidean space separated by distances of at least 2. The density of a pack- 
ing is defined as the lim sup of the densities of the partial packings formed by spheres 
inside a ball with fixed center of radius R. (By taking the lim sup, rather than lim inf, 
as the density, we prove the Kepler conjecture in the strongest possible sense.) Defined 
as a limit, the density is insensitive to changes in the packing in any bounded region. 
For example, a finite number of spheres can be removed from the face-centered cubic 
packing without affecting its density. 

Consequently, it is not possible to hope for any strong uniqueness results for packings 
of optimal density. The uniqueness established by this work is as strong as can be hoped 
for. It shows that certain local structures (decomposition stars) attached to the face- 
centered cubic (fcc) and hexagonal-close packings (hcp) are the only structures that 
maximize a local density function. 
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Fig. 1.1. The fcc packing. 


Although we do not pursue this point, Conway and Sloane develop a theory of tight 
packings that is more restrictive than having the greatest possible density [CS1]. An open 
problem is to prove that their list of tight packings in three dimensions is complete. 

The fcc packing appears in Fig. 1.1. 

The following facts about packings are well known. However, there is a popular and 
persistent misconception in the popular press that the fcc packing is the only packing 
with density 2//18. The comments that follow correct this misconception. 

In the fcc packing, each ball is tangent to twelve others. For each ball in the packing, 
this arrangement of twelve tangent balls is the same. We call it the fcc pattern. In the 
hcp, each ball is tangent to twelve others. For each ball in the packing, the arrangement 
of twelve tangent balls is again the same. We call it the hcp pattern. The fcc pattern is 
different from the hcp pattern. In the fcc pattern there are four different planes through 
the center of the central ball that contain the centers of six other balls at the vertices of a 
regular hexagon. In the hcp pattern there is only one such plane. We call the arrangement 
of balls tangent to a given ball the local tangent arrangement of the ball. 

There are uncountably many packings of density 2 /./18 that have the property that 
every ball is tangent to twelve others and such that the tangent arrangement around each 
ball is either the fcc pattern or the hcp pattern. 

By hexagonal layer, we mean a translate of the two-dimensional lattice of points M 
in the A, arrangement. That is, M is a translate of the planar lattice generated by two 
vectors of length 2 and angle 27/3. The fcc packing is an example of a packing built 
from hexagonal layers. 

If M is a hexagonal layer, a second hexagonal layer M’ can be placed parallel to the 
first so that each lattice point of M’' has distance 2 from three different vertices of M. 
When the second layer is placed in this manner, it is as close to the first layer as possible. 
Fix M and a unit normal to the plane of M. The normal allows us to speak of the second 
layer M’ as being “above” or “below” layer M. There are two different positions in 
which M’ can be placed closely above M and two different positions in which M’ can 
be placed closely below M. As we build a packing, layer by layer (M, M’, M”, and so 
forth), there are two choices at each stage of the close placement of the layer above the 
previous layer. Running through different sequences of choices gives uncountably many 
packings. In each of these packings the tangent arrangement around each ball is that of 
the twelve spheres in the fcc or the twelve spheres in the hcp. 
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Let A be a packing built as a sequence of close-packed hexagonal layers in this 
fashion. If P is any plane parallel to the hexagonal layers, then there are at most three 
different orthogonal projections of layers M to P. Call these projections A, B and C. 
Each hexagonal layer has a different projection than the layers immediately above and 
below it. In the fcc packing, the successive layers are A, B,C, A, B, C,.... In the hcp 
packing, the successive layers are A, B, A, B,.... If we represent A, B, and C as the 
vertices of a triangle, then the succession of hexagonal layers can be described by a walk 
along the vertices of the triangle. Different walks through the triangle describe different 
packings. 

In fact, the different walks through a triangle give all packings of infinitely many 
equal balls in which the tangent arrangement around every ball is either the fcc pattern 
of twelve balls or the hcp pattern of twelve balls. 


We justify the fact that different walks through a triangle give all such packings. 
Assume first that a packing A contains a ball (centered at vg) in the hcp pattern. 
The hcp pattern contains a uniquely determined plane of symmetry. This plane 
contains vo and the centers of six others arranged in a regular hexagonal. If v 
is the center of one of the six others in the plane of symmetry, its local tangent 
arrangement of twelve balls must include vp and an additional four of the twelve 
balls around vo. These five centers around v are not a subset of the fcc pattern. 
They can be uniquely extended to twelve centers arranged in the hcp pattern. This 
hcp pattern has the same plane of symmetry as the hcp pattern around vo. In this 
way, as soon as there is a single center with the hcp pattern, the pattern propagates 
along the plane of symmetry to create a hexagonal layer M. 

Once a packing A contains a single hexagonal layer, the condition that each 
ball be tangent to twelve others forces a hexagonal layer M’ above M and another 
hexagonal layer below M. Thus, a single hexagonal layer forces a sequence of 
close-packed hexagonal layers in both directions. 

We have justified the claim under the hypothesis that A contains at least one 
ball with the hcp pattern. 

Assume that A does not contain any balls whose local tangent arrangement is 
the hcp pattern. Then every local tangent arrangement is the fcc pattern, and A 
itself is then the face-centered cubic packing. This completes the proof. 


1.2. Early History, Hariot, and Kepler 


The study of the mathematical properties of the fcc packing can be traced back to a 
Sanskrit work composed around 499 CE [P]. 

The modern mathematical study of spheres and their close packings can be traced to 
T. Hariot. Hariot’s work—unpublished, unedited, and largely undated—shows a preoc- 
cupation with sphere packings. He seems to have first taken an interest in packings at 
the prompting of Sir Walter Raleigh. At the time, Hariot was Raleigh’s mathematical 
assistant, and Raleigh gave him the problem of determining formulas for the number of 
cannonballs in regularly stacked piles. In 1591 he prepared a chart of triangular numbers 
for Raleigh. Shirley, Hariot’s biographer, writes that the charts prepared for Raleigh led 
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Fig. 1.2. Hariot’s view of Pascal’s triangle. 


to the study of the sums of squares, “a study which led inevitably to the corpuscular or 
atomic theory of matter orginally deriving from Lucretius and Epicurus” [Sh, p. 242]. 

Hariot connected sphere packings to Pascal’s triangle long before Pascal introduced 
the triangle. See Fig. 1.2. 

Hariot was the first to distinguish between the fcc and hcp [Ma, p. 52]. 

Kepler became involved in sphere packings through his correspondence with Hariot in 
the early years of the 17th century. Despite Kepler’s initial reluctance to adopt an atomic 
theory, he was eventually swayed, and in 1611 he published an essay that explores the 
consequences of a theory of matter composed of small spherical particles. Kepler’s essay 
was the “first recorded step towards a mathematical theory of the genesis of inorganic 
or organic form” [W, p. v]. 

Kepler’s essay describes the fcc packing and asserts that “the packing will be the 
tightest possible, so that in no other arrangement could more pellets be stuffed into the 
same container.” This assertion has come to be known as the Kepler conjecture. The 
purpose of this collection of papers is to give a proof of this conjecture. 


1.3. History 


The next episode in the history of this problem is a debate between Isaac Newton and 
David Gregory. Newton and Gregory discussed the question of how many spheres of 
equal radius can be arranged to touch a given sphere. This is the three-dimensional 
analogue of the simple fact that in two dimensions six pennies, but no more, can be 
arranged to touch a central penny. This is the kissing-number problem in n-dimensions. 
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In three dimensions Newton said that the maximum was twelve spheres, but Gregory 
claimed that thirteen might be possible. 

Newton was correct. In the 19th century, the first papers claiming a proof of the 
kissing-number problem appeared in [Ben], [Gu], and [Ho]. Although some writers 
cite these papers as a proof, they are hardly rigorous by today’s standards. Another 
incorrect proof appears in [Bo]. The first proper proof was obtained by Schiitte and van 
der Waerden in 1953 [SW]. An elementary proof appears in Leech [Le]. The influence 
of van der Waerden, Schiitte, and Leech upon the papers in this collection is readily 
apparent. Although the connection between the Newton—Gregory problem and Kepler’s 
problem is not obvious, L. Fejes Téth in 1953, in the first work describing a strategy 
to prove the Kepler conjecture, made a quantitative version of the Gregory—Newton 
problem the first step [Fej6]. 

The two-dimensional analogue of the Kepler conjecture is to show that the honeycomb 
packing in two dimensions gives the highest density. This result was established in 1892 
by Thue, with a second proof appearing in 1910 [T1], [T2]. Szpiro’s book on the Kepler 
conjecture calls Thue’s proofs into question [Sz]. C. Siegel said that Thue’s original 
proof is “reasonable, but full of holes” [Sz]. A number of other proofs have appeared 
since then. Three are particularly notable. Rogers’s proof generalizes to give a bound 
on the density of packings in any dimension [Rol]. A proof by L. Fejes Téth extends 
to give bounds on the density of packings of convex disks [Fej5]. A third proof, also by 
L. Fejes Toth, extends to non-Euclidean geometries [Fej6]. Another early proof appears 
in [SM]. 

In 1900, Hilbert made the Kepler conjecture part of his 18th problem [Hi]. Milnor, in 
his review of Hilbert’s 18th problem, breaks the problem into three parts [Mi]. The third 
part asks for the densest arrangements of congruent solids such as “spheres with given 
radii.” Milnor writes that it is a “scandalous situation” that the sphere packing “problem 
in 3 dimensions remains unsolved.” 


1.4. The Literature 


Past progress toward the Kepler conjecture can be arranged into four categories: 


e bounds on the density, 

e descriptions of classes of packings for which the bound of 2//18 is known, 

e convex bodies other than spheres for which the packing density can be determined 
precisely, 

e strategies of proof. 


1.4.1. Bounds. Various upper bounds have been established on the density of packings: 


0.884 (Blichfeldt) [B11], 
0.835 (Blichfeldt) [B12], 
0.828 (Rankin) [Ra], 
0.7797 (Rogers) [Rol], 
0.77844 (Lindsey) [Li], 
0.77836 (Muder)[Mul1], 
0.7731 (Muder) [Mu2]. 
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Rogers’s is a particularly natural bound. As the dates indicate, it remained the best 
available bound for many years. His monotonicity lemma and his decomposition of 
Voronoi cells into simplices have become important elements in the proof of the Kepler 
conjecture. We give a new proof of Rogers’s bound in “Sphere Packings, III” A func- 
tion t, used throughout this collection, measures the departure of various objects from 
Rogers’s bound. 

Muder’s bounds, although they appear to be rather small improvements of Rogers’s 
bound, are the first to make use of the full Voronoi cell in the determination of densities. 
As such, they mark a transition to a greater level of sophistication and difficulty. Muder’s 
influence on the work in this collection is also apparent. 

A sphere packing admits a Voronoi decomposition: around every sphere take the 
convex region consisting of points closer to that sphere center than to any other sphere 
center. L. Fejes Téth’s dodecahedral conjecture asserts that the Voronoi cell of smallest 
volume is a regular dodecahedron with inradius | [Fej4]. The dodecahedral conjecture 
implies a bound of 0.755 on sphere packings. L. Fejes Toth actually gave a complete proof 
except for one estimate. A footnote in his paper documents the gap, “In the proof, we have 
relied to some extent solely on intuitive observation [Anschauung].” As L. Fejes Toth 
pointed out, that estimate is extraordinarily difficult, and the dodecahedral conjecture 
has resisted all efforts until now [Mc]. 

The missing estimate in L. Fejes Téth’s paper is an explicit form of the Newton— 
Gregory problem. What is needed is an explicit bound on how close the thirteenth sphere 
can come to touching the central sphere, or, more generally, to minimize the sum of 
the distances of the thirteen spheres from the central sphere. No satisfactory bounds are 
known. Boerdijk has a conjecture for the arrangement that minimizes the average distance 
of the thirteen spheres from the central sphere. Van der Waerden has a conjecture for the 
closest arrangement of thirteen spheres in which all spheres have the same distance from 
the central sphere. Bezdek has shown that the dodecahedral conjecture would follow 
from weaker bounds than those originally proposed by L. Fejes Toth [Bez2]. 

A proof of the dodecahedral conjecture has traditionally been viewed as the first step 
toward a proof of the Kepler conjecture, and if little progress has been made until now 
toward a complete solution of the Kepler conjecture, the difficulty of the dodecahedral 
conjecture is certainly responsible to a large degree. 


1.4.2. Classes of Packings. If the infinite-dimensional space of all packings is too 
unwieldy, we can ask if it is possible to establish the bound 7/18 for packings with 
special structures. 

If we restrict the problem to packings whose sphere centers are the points of a lattice, 
the packings are described by a finite number of parameters, and the problem becomes 
much more accessible. Lagrange proved that the densest lattice packing in two dimen- 
sions is the familiar honeycomb arrangement [La]. Gauss proved that the densest lattice 
packing in three dimensions is the fcc [G]. In dimensions 4-8 the optimal lattices are 
described by their root systems, Az, A3, D4, Ds, E¢, E7, and Eg. Korkine and Zolotareff 
showed that D4 and Ds are the densest lattice packings in dimensions 4 and 5 [KZ1], 
[KZ2]. Blichfeldt determined the densest lattice packings in dimensions 6-8 [B13]. Cohn 
and Kumar solved the problem in dimension 24 [CK]. With the exception of dimension 
24, beyond dimension 8, there are no proofs of optimality, and yet there are many excel- 
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lent candidates for the densest lattice packings. For a proof of the existence of optimal 
lattices, see [O]. 

Although lattice packings are of particular interest because they relate to so many 
different branches of mathematics, Rogers has conjectured that in sufficiently high di- 
mensions, the densest packings are not lattice packings [Ro2]. In fact, the densest known 
packings in various dimensions are not lattice packings. The third edition of [CS2] gives 
several examples of nonlattice packings that are denser than any known lattice packings 
(dimensions 10, 11, 13, 18, 20, 22). The densest packings of typical convex sets in the 
plane, in the sense of Baire categories, are not lattice packings [Fej1]. 

Gauss’s theorem on lattice densities has been generalized by Bezdek, Kuperberg, and 
Makai, Jr. [BKM]. They showed that packings of parallel strings of spheres never have 
density greater than 7/4/18. 


1.4.3. Other Convex Bodies. If the optimal sphere packings are too difficult to deter- 
mine, we might ask whether the problem can be solved for other convex bodies. To avoid 
trivialities, we restrict our attention to convex bodies whose packing density is strictly 
less than 1. 

The first convex body in Euclidean 3-space that does not tile for which the packing 
density was explicitly determined is an infinite cylinder [BK]. Here Bezdek and Ku- 
perberg prove that the optimal density is obtained by arranging the cylinders in parallel 
columns in the honeycomb arrangement. 

In 1993 Pach exposed the humbling depth of our ignorance when he issued the 
challenge to determine the packing density for some bounded convex body that does 
not tile space [MP]. (Pach’s question is more revealing than anything I can write on 
the subject of discrete geometry.) This question was answered by Bezdek [Bez1], who 
determined the packing density of a rhombic dodecahedron that has one corner clipped 
so that it no longer tiles. The packing density equals the ratio of the volume of the clipped 
rhombic dodecahedron to the volume of the unclipped rhombic dodecahedron. 


1.4.4. Strategies of Proof. In 1953 L. Fejes Toth proposed a program to prove the 
Kepler conjecture [Fej6]. A single Voronoi cell cannot lead to a bound better than the 
dodecahedral conjecture. L. Fejes Toth considered weighted averages of the volumes 
of collections of Voronoi cells. These weighted averages involve up to thirteen Voronoi 
cells. He showed that if a particular weighted average of volumes is greater than the 
volume of the rhombic dodecahedron, then the Kepler conjecture follows. The Kepler 
conjecture is an optimization problem in an infinite number of variables. L. Fejes Toth’s 
weighted-average argument was the first indication that it might be possible to reduce 
the Kepler conjecture to a problem in a finite number of variables. Needless to say, 
calculations involving the weighted averages of the volumes of several Voronoi cells 
will be significantly more difficult than those involved in establishing the dodecahedral 
conjecture. 

To justify his approach, which limits the number of Voronoi cells to thirteen, L. Fejes 
Toth needs a preliminary estimate of how close a thirteenth sphere can come to a central 
sphere. It is at this point in his formulation of the Kepler conjecture that an explicit version 
of the Newton—Gregory problem is required. How close can thirteen spheres come to a 
central sphere, as measured by the sum of their distances from the central sphere? 
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L. Fejes Toth made another significant suggestion in [Fej7]. He was the first to suggest 
the use of computers in the Kepler conjecture. After describing his program, he mentions 
that his approach reduces the problem to an optimization problem in a finite number of 
variables. He suggests that computers might be used to approximate the minimum to 
this optimization problem. 

The most widely publicized attempt to prove the Kepler conjecture was that of Hsiang 
[Hs1]. (See also [Hs2], [Hs3], and [Hs5].) Hsiang’s approach can be viewed as a con- 
tinuation and extension of L. Fejes T6th’s program. Hsiang’s paper contains major gaps 
and errors [CHMS]. The mathematical arguments against his argument appear in my 
debate with him in the Mathematical Intelligencer [Ha3], [Hs4]. There are now many 
published sources that agree with the central claims of [Ha3] against Hsiang. Conway 
and Sloane report that the paper “contains serious flaws.” G. Fejes Téth feels that “the 
greater part of the work has yet to be done” [Fej1]. Bezdek concluded, after an extensive 
study of Hsiang’s work, “his work is far from being complete and correct in all de- 
tails” [Bez2]. Muder writes, “the community has reached a consensus on it: no one buys 
it” [Mu3]. 


2. Overview of the Proof 
2.1. Experiments with other Decompositions 


The following two sections (added January 2003) describe some of the motivation behind 
the partitions of space that have been used in the proof of the Kepler conjecture. This 
discussion includes various ideas that were tried, found wanting, and discarded. However, 
this discussion provides motivation for some of the choices that appear in the proof of 
the Kepler conjecture. 

Let S be a regular tetrahedron of side length 2. If we place a unit ball at each of the 
four vertices, the fraction of the tetrahedral solid occupied by the part of the four balls 
within the tetrahedron is dt. * 0.7797. Let O be a regular octahedron of side length 2. 
If we place a unit ball at each of the four vertices, the fraction of the octahedral solid 
occupied by the four balls is do¢¢ * 0.72. The fec packing can be obtained by packing 
eight regular tetrahedra and six regular octahedra around each vertex. The density 2 //18 
of this packing is a weighted average of dtet and Soct: 


ss 
V18 


My early conception (around 1989) was that for every packing of congruent balls, there 
should be a corresponding partition of space into regions of high density and regions 
of low density. Regions of high density should be defined as regions having density 
between doct and dtet, and regions of low density should be defined as those regions of 
density at most doct. It was my intention to prove that all regions of high density had to 
be confined to a set of nonoverlapping tetrahedra whose vertices are centers of the balls 
in the packing. 

Thus, the question naturally arises of how much a regular tetrahedron of edge length 
2 can be deformed before its density drops below that of a regular octahedron doct. 


1 2 
= 3 Otet = 3 Soct- 
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0.726 


0.724 


0.722 


0.7181 


Fig. 2.1. The origin of the constant 2.51. 


The graph in Fig. 2.1 shows the density of a tetrahedron with five edges of length 2 
and a sixth edge of length x. Numerically, we see that the density drops below doct, 
when x = x9 © 2.504. To achieve the design goal of confining regions of high den- 
sity to tetrahedra, we want a tetrahedron of edge lengths 2,2,2,2,2,x, for x < x0, 
to be counted as a region of high density. Rounding upward, this example led to 
the cutoff parameter of 2.51 that distinguishes the tetrahedra (in the high density re- 
gion) from the rest of space. This is the origin of the constant 2.51 that appears in 
the proof. 

Since the tetrahedra are chosen to have vertices at the centers of the balls in the packing, 
it was quite natural to base the decomposition of space on the Delaunay decomposition. 
According to this early conception, space was to be partitioned into Delaunay simplices. 
A Delaunay simplex whose edge lengths are at most 2.51 is called a quasi-regular 
tetrahedron. These were the regions of presumably high density. According to the strategy 
in those early days, all other Delaunay simplices were to be shown to belong to regions 
of density at most doct. 

The following problem occupied my attention for a long period. 


Problem. Fix a saturated packing. Let X(oct) be the part of space of a saturated 
packing that is occupied by the Delaunay simplices having at least one edge of length at 
least 2.51. Let X (tet) be the union of the complementary set of Delaunay simplices. Is 
it always true that the density of X (oct) is at most doct? 


Early on, I viewed the positive resolution of this problem as crucial to the solution of 
the Kepler conjecture. Eventually, when I divided the proof of the Kepler conjecture into 
a five-step program, a variant of this problem became the second step of the program. 
See [Ha7]. 

To give an indication of the complexity of this problem, consider the simplex with 


edge lengths 2,2, 2,2, £, 2, where € = ,/2(3+ 6) =~ 3.301. Assume that the two 
longer edges meet at a vertex. This simplex can appear as the Delaunay simplex in a 
saturated packing. Its density is about 0.78469. This constant is not only greater than 
Soct3 it is even greater than d,, so that the problem is completely misguided at the level 
of individual Delaunay simplices in X (oct). It is only when the union of Delaunay 
simplices is considered that we can hope for an affirmative answer to the problem. 


[f 
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By the summer of 1994, I had lost hope of finding a partition of the set X (oct) into 
small clusters of Delaunay simplices with the property that each cluster had density at 
most doct. Progress had ground to a halt. The key insight came in the fall of 1994 (on 
Nov. 12, 1994 to be precise). On that day, I introduced a hybrid decomposition that relied 
on the Delaunay simplices in the regions X (tet) formed by quasi-regular tetrahedra, but 
that switched to the Voronoi decomposition in certain regions of X (oct). By April 1995 I 
had reformulated the problem, worked out a proof of the problem [Ha7] in its new form, 
and submitted it for publication. I submitted a revised version of [Ha6] that same month. 
The revision mentions the new strategy: “The rough idea is to let the score of a simplex 
in a cluster be the compression I'(S) [a function based on the Delaunay decomposition] 
if the circumradius of every face of S small, and otherwise to let the score be defined by 
Voronoi cells (in a way that generalizes the definition for quasi-regular tetrahedra).” See 
p. 6 of [Ha6]. 

The situation is somewhat more complicated than the previous paragraph suggests. 
Consider a Delaunay simplex S$ with edge lengths 2, 2, 2,2, 2,2.52. Such a simplex 
belongs to the region X (oct). However, if we break it into four pieces according to 
the Voronoi decomposition, the density of two of the pieces is about 0.696 < doc 
and the density of the other two is about 0.7368 > doct. It is desirable not to have 
any separate regions in X (oct) of density greater than 5,. Hence it is preferable to 
keep the four Voronoi regions in S$ together as a single Delaunay simplex. A second 
reason to keep S together is that the proof of the local optimality of the fcc packing 
and hcp seems to require it. A third reason was to treat pentahedral prisms. (This is a 
thorny class of counterexamples to a pure Delaunay simplex approach to the proof of the 
Kepler conjecture. See [Hal], [Ha2], and [Fer].) For these reasons, we identify a class 
of Delaunay simplices in X (oct) (such as S) that are to be treated according to a special 
set of rules. They are called quarters. As the name suggests, they often occur as the four 
simplices comprising an octahedron that has been “quartered.” 

One of the great advantages of a hybrid approach is that there is a tremendous amount 
of flexibility in the choice of the details of the decomposition. The details of the decom- 
position continued to evolve during 1995 and 1996. Finally, during a stay in Budapest 
following the Second European Congress in 1996, I abandoned all vestiges of the Delau- 
nay decomposition, and adopted definitions of quasi-regular tetrahedra and quarters that 
rely only on the metric properties of the simplices (as opposed to the Delaunay criterion 
based on the position of other sphere centers in relation to the circumscribing sphere of 
the simplex). This decomposition of space is essentially what is used in the final proof. 

The hybrid construction depends on certain choices of functions (satisfying a rather 
mild set of constraints). To solve the Kepler conjecture appropriate functions had to 
be selected, and an optimization problem based on those functions had to be solved. 
This function is called the score. Samuel Ferguson and I realized that every time we 
encountered difficulties in solving the minimization problem, we could adjust the scoring 
function o to skirt the difficulty. The function o became more complicated, but with 
each change we cut months—or even years—from our work. This incessant fiddling 
was unpopular with my colleagues. Every time I presented my work in progress at a 
conference, I was minimizing a different function. Even worse, the function was mildly 
incompatible with what I did in earlier papers [Ha6], [Ha7], and this required going back 
and patching the earlier papers. 
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The definition of the scoring function o did not become fixed until it came time for 
Ferguson to defend his thesis, and we finally felt obligated to stop tampering with it. 
The final version of the scoring function o is rather complicated. The reasons for the 
precise form of o cannot be described without a long and detailed description of dozens 
of sphere clusters that were studied in great detail during the design of this function. 
However, a few general design principles can be mentioned. These comments assume a 
certain familiarity with the design of the proof. 


(1) Simplices (with vertices at the centers of the balls in the packing) should be used 
whenever careful estimates of the density are required. Voronoi cells should be 
used whenever crude estimates suffice. For Voronoi cells, it is clear what the 
scoring function should be vor(R) (and its truncated versions vorg(R), and so 
forth). 

(2) The definition of the scoring function for quasi-regular tetrahedra was fixed in 
[Ha6] and this definition had to remain fixed to avoid rewriting that long paper. 


Because of these first two points, most of the design effort for the function o was 
focused on quarters. 


(3) The decision to make the scoring for a quarter change when the circumradius of 
a face reaches ./2 is to make the proof of the local optimality of the fec and hep 
run smoothly. From [Ha7], we see that the cutoff value /2 is important for the 
success of that proof. The cutoff ./2 is also important for the proof that standard 
regions (other than quasi-regular tetrahedra) score at most 0 pt. 

(4) The purpose of adding terms to the scoring function o that depend on the truncated 
Voronoi function vorg is to make interval arithmetic comparisons between o 
and vorg easier to carry out. This is useful in arguments about “erasing upright 
quarters.” 


2.2. Contents of the Papers 


In [Ha6] a five-step program was described to prove the Kepler conjecture. It was planned 
that there would be five papers, each proving one step in the program. The papers [Ha6] 
and [Ha7] carry out the first two steps in the program. Because of the changes in the 
scoring function, it was necessary to issue a short paper [FH] mid-stream whose purpose 
was to give some adjustments to the five-step program. This paper adjusts the definitions 
from [Ha6] and checks that none of the results from [Ha6] and [Ha7] are affected in an 
essential way by these changes. Following this, the papers [Ha9] and [Fer] appeared in 
preprint form, completing the third and fifth steps of the program. The fourth step turned 
out to be particularly difficult. It occupies two separate papers [Hal0] and [Ha11]. 

The original series of papers suffers from the defect of being written over a span 
of several years. Some shifts in the conceptual framework of the research are evident. 
Based on comments from referees, a revision of these papers was prepared in 2002. The 
revisions were small, except for the paper [Hal 1], which was completely rewritten. The 
structure of the proof remains the same, but it adds a substantial amount of introductory 
material that lessens the dependence on [Ha6] and [Ha7]. 
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The papers were reorganized again in 2003. The series of papers is no longer orga- 
nized along the original five steps with a mid-stream correction. Instead, the proof is now 
arranged according to the logical development of the subject matter. Only minor mod- 
ifications have been made to the original proof. (The earlier versions are still available 
from [arXiv].) In the 2003 revision the exposition of the proof is entirely independent of 
the earlier papers [Ha6] and [Ha7]. 

An introduction to the ideas of the proof can be found in [Hal2]. An introduction to 
the algorithms can be found in [Hal4]. Speculation on a second-generation design of a 
proof can be found in [Hal4] and [Ha13]. 


2.3. Complexity 


Why is this a difficult problem? There are many ways to answer this question. 

This is an optimization problem in an infinite number of variables. In many respects 
the central problem has been to formulate a good finite-dimensional approximation to the 
density of a packing. Beyond this, there remains an extremely difficult problem in global 
optimization, involving nearly 150 variables. We recall that even very simple classes of 
nonlinear optimization problems, such as quadratic optimization problems, are NP-hard 
[HPT]. A general highly nonlinear program of this size is regarded by most researchers 
as hopeless (at least as far as rigorous methods are concerned). 

There is a considerable literature on many closely related nonlinear optimization 
problems (the Tammes problem, circle packings, covering problems, the Lennard—Jones 
potential, Coulombic energy minimization of point particles, and so forth). Many of our 
expectations about nonlattice packings are formed by the extensive experimental data 
that have been published on these problems. The literature leads one to expect a rich 
abundance of critical points, and yet it leaves one with a certain skepticism about the 
possibility of establishing general results rigorously. 

The extensive survey of circle packings in [Me] gives a broad overview of the progress 
and limits of the subject. Problems involving a few circles can be trivial to solve. Problems 
involving several circles in the plane can be solved with sufficient ingenuity. With the aid 
of computers, various problems involving a few more circles can be treated by rigorous 
methods. Beyond that, numerical methods give approximations but no rigorous solutions. 
Melissen’s account of the 20-year quest for the best separated arrangement of ten points 
in a unit square is particularly revealing of the complexities of the subject. 

Kepler’s problem has a particularly rich collection of (numerical) local maxima that 
come uncomfortably close to the global maximum [Hal]. These local maxima explain in 
part why a large number (around 5000) of planar maps are generated as part of the proof 
of the conjecture. Each planar map leads to a separate nonlinear optimization problem. 


2.4. Computers 
As this project has progressed, the computer has replaced conventional mathematical 


arguments more and more, until now nearly every aspect of the proof relies on computer 
verifications. Many assertions in these papers are results of computer calculations. To 
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make the proof of Kepler’s conjecture more accessible, I have posted extensive resources 
[arXiv]. 

Computers are used in various significant ways. They will be mentioned briefly here, 
and then developed more thoroughly elsewhere in the collection, especially in the final 


paper. 


1. Proof of inequalities by interval arithmetic. “Sphere Packings, I” describes a 
method of proving various inequalities in a small number of variables by computer 
by interval arithmetic. 

2. Combinatorics. A computer program classifies all of the planar maps that are 
relevant to the Kepler conjecture. 

3. Linear programming bounds. Many of the nonlinear optimization problems for the 
scores of decomposition stars are replaced by linear problems that dominate the 
original score. They are solved by linear programming methods by computer. A 
typical problem has between 100 and 200 variables and 1000 and 2000 constraints. 
Nearly 100,000 such problems enter into the proof. 

4. Branch and bound methods. When linear programming methods do not give suf- 
ficiently good bounds, they have been combined with branch and bound methods 
from global optimization. 

5. Numerical optimization. The exploration of the problem has been substantially 
aided by nonlinear optimization and symbolic mathematical packages. 

6. Organization of output. The organization of the few gigabytes of code and data 
that enter into the proof is in itself a nontrivial undertaking. 
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Abstract. This paper is the second in a series of six papers devoted to the proof of the 
Kepler conjecture, which asserts that no packing of congruent balls in three dimensions has 
density greater than the face-centered cubic packing. The top level structure of the proof 
is described. A compact topological space is described. Each point of this space can be 
described as a finite cluster of balls with additional combinatorial markings. A continuous 
function on this compact space is defined. It is proved that the Kepler conjecture will follow 
if the value of this function is never greater than a given explicit constant. 


Introduction 


The following papers give a proof of the Kepler conjecture, which asserts that no packing 
of congruent balls in three-dimensional Euclidean space has density exceeding that of 
the face-centered cubic (fcc) packing. 

A historical overview of the Kepler conjecture is found in the first paper in this series. 
Since the history of this problem is treated there, this paper does not go into the details 
of the extensive literature on this problem. We mention that Hilbert included the Kepler 
conjecture as part of his eighteenth problem [Hi]. L. Fejes Toth was the first to formulate 
a plausible strategy for a proof [Fej8]. He also suggested that computers might play a role 
in the solution of this problem. The historical account also discusses the development of 
some of the key concepts of this paper. 

An expository account of the proof is contained in [Hal2]. A general reference on 
sphere packings is [CS2]. A general discussion of the computer algorithms that are used 
in the proof can be found in [Hal4]. Some speculations on the structure of a second- 
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generation proof can be found in [Ha13]. Details of computer calculations can be found 
on the internet at [Hal6]. 

The first section of this paper gives the top level structure of the proof of the Kepler 
conjecture. The next two sections describe the fundamental decompositions of space 
that are needed in the proof. The first decomposition, which is called the Q-system, is a 
collection of simplices that do not overlap. This decomposition was originally inspired 
by the Delaunay decomposition of space. The other decomposition, which is called the 
V-cell decomposition, is closely related to the Voronoi decomposition of space. In the 
following section these two decompositions of space are combined into geometrical ob- 
jects called decomposition stars. The decomposition star is the fundamental geometrical 
object in the proof of the Kepler conjecture. 

The final section of this paper, which was coauthored with Samuel P. Ferguson, 
describes a particular nonlinear function on the set of all decomposition stars, called the 
scoring function. The Kepler conjecture reduces to an optimization problem involving 
this nonlinear function on the set of all decomposition stars. This is an optimization 
problem in a finite number of variables. The subsequent papers (Papers I-VI) solve 
that optimization problem. 

The choice of the particular scoring function to use was arrived at jointly with 
Samuel P. Ferguson. He has contributed to this project in many important ways, in- 
cluding the results in Section 7. 

Some history of the proof and this paper is as follows. The original proof, as envisioned 
in 1994 and accomplished in 1998, was divided into a five-step program. As a result, the 
original papers were called “Sphere Packings, I,” “Sphere Packings, II,” and so forth. 
The first two papers in the series were published in an earlier volume of DCG. As it 
turned out, the fourth step “Sphere Packings, IV” is considerably more difficult than the 
other steps in the program. It became clear that a single paper would not suffice, and the 
fourth step of the proof was divided into two parts “Sphere Packings, IV” and “Kepler 
Conjecture (Sphere Packings, VI).” Samuel Ferguson’s thesis “Sphere Packings, V” 
solved one of the five major steps in the proof. (Although “Sphere Packings, IV” and 
“Sphere Packings, VI’ belonged together, because of the numbering scheme, Ferguson’s 
theses “Sphere Packings, V” was inserted between these two papers.) 

The proof that is contained in this volume is a rewritten version of the proof. For his- 
torical reasons, the papers in this volume have retained the original titles, but because of 
extensive revisions over the past several years, the proof is no longer arranged according 
to the five steps of the 1994 program. 

In addition to the 5 + | papers corresponding to the five steps of the original program, 
there is the current paper. It has the following origin. In 1996 it became clear that progress 
on the problem required some adjustments in the main nonlinear optimization problem of 
“Sphere Packings, I” and “II.” As the original 1996 manuscript put it, “There are infinitely 
many scoring schemes that should lead to a proof of the Kepler conjecture. The problem 
is to formulate the scheme that makes the Kepler conjecture as accessible as possible” 
[Ha5]. The original purpose of this paper was to make some useful improvements in the 
scoring function from “Sphere Packings, I’ and “IT’ and to make the changes in such a 
way that the main results of those papers would still hold true. 

Over the past years, this paper has grown considerably in scope to the point that it is 
now lays the foundation for all of the papers in the series. In fact, all of the foundational 
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material from “Sphere Packings, I’ and “II”’ and the 1998 preprint series has been 
collected together in this article. The scoring function is no longer the same as the one 
presented in “Sphere Packings, I” and “II.” This paper adapts the relevant material from 
these earlier papers to the current scoring function. This paper has expanded to the point 
that it is now possible to understand the entire proof of the Kepler conjecture without 
reading “T’ and “II.” 


3. The Top-Level Structure of the Proof 


This section describes the structure of the proof of the Kepler conjecture. 


3.1. Statement of Theorems 


Theorem 3.1 (The Kepler Conjecture). No packing of congruent balls in Euclidean 
three space has density greater than that of the face-centered cubic packing (see Fig.3.1). 


This density is 7 //18 © 0.74. 

The proof of this result is presented in this paper. Here, we describe the top-level 
outline of the proof and give references to the sources of the details of the proof. 

By a packing, we mean an arrangement of congruent balls that are nonoverlapping 
in the sense that the interiors of the balls are pairwise disjoint. Consider a packing of 
congruent balls in Euclidean three space. There is no harm in assuming that all the balls 
have unit radius. The density of a packing does not decrease when balls are added to 
the packing. Thus, to answer a question about the greatest possible density we may add 
nonoverlapping balls until there is no room to add further balls. Such a packing is said 
to be saturated. 

Let A be the set of centers of the balls in a saturated packing. Our choice of radius for 
the balls implies that any two points in A have distance at least 2 from each other. We call 
the points of A vertices. Let B(x, r) denote the closed ball in Euclidean three space at 
center x and radius r. Let 6(x, r, A) be the finite density, defined as the ratio of the volume 
of B(x,r, A) to the volume of B(x, 7), where B(x, r, A) is defined as the intersection 
with B(x, r) of the union of all balls in the packing. Set A(x, r) = AN B(x,r). 


Fig. 3.1. The fec packing. 
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Recall that the Voronoi cell Q(v) = Q(v, A) around a vertex v € A is the set of 
points closer to v than to any other ball center. The volume of each Voronoi cell in the 
fee packing is 32. This is also the volume of each Voronoi cell in the hexagonal-close 
packing (hcp). 


Definition 3.2. Let A: A — R bea function. We say that A is negligible if there is a 
constant C, such that for all r > 1 and all x € R3, 


Y> AQ) < Cir? 
veA(x,r) 
We say that the function A: A — R is fcc-compatible if for all v € A we have the 
inequality 


V/32 < vol(Q(v)) + A(v). 


The value vol({2(v)) + A(v) may be interpreted as a corrected volume of the Voronoi 
cell. Fec-compatibility asserts that the corrected volume of the Voronoi cell is always at 
least the volume of the Voronoi cells in the fec and hcp. 


Lemma 3.3. /f there exists a negligible fcc-compatible function A: A — R for a 
saturated packing A, then there exists a constant C such that for all r > 1 and all 
xeR, 

b(x,r, A) < a/V18+C/r. 


The constant C depends on A only through the constant C,. 


Proof. The numerator vol B(x, r, A) of 6(x, r, A) is at most the product of the volume 
of a ball 42/3 with the number |A(x, 7 + 1)| of balls intersecting B(x, r). Hence 


vol B(x, r, A) < |A(x, r+ 1)|427/3. (3.1) 


In a saturated packing each Voronoi cell is contained in a ball of radius 2 centered at 
the center of the cell. The volume of the ball B(x, r +3) is at least the combined volume 
of Voronoi cells whose center lies in the ball B(x, r + 1). This observation, combined 
with fcc-compatibility and negligibility, gives 


V32\A@,r+ D1 < DS) (Av) + volK(QQ@))) 
ve A(x,r+l) 
< C,(r +1)? + vol B(x, r + 3) 
< Cir +1)? +. 4+3/r)*vol B(x, r). (3.2) 


Recall that (x, r, A) = vol B(x, r, A)/vol B(x, r). Divide inequality (3.1) through by 
vol B(x, r). Use inequality (3.2) to eliminate | A(x, r + 1)| from the resulting inequality. 


This gives 
1 ay" (r +1)? 
sin ays = (142) +C ; 
VIB r "73/32 


The result follows for an appropriately chosen constant C. 
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An analysis of the preceding proof shows that fec-compatibility leads to the particular 
value z/./18 in the statement of Lemma 3.3. If fec-compatibility were to be dropped 
from the hypotheses, any negligible function A would still lead to an upper bound 
4m /(3L) on the density of a packing, expressed as a function of a lower bound L on all 
vol Q(v) + A(v). 


Remark 3.4. We take the precise meaning of the Kepler conjecture to be a bound 
on the essential supremum of the function 6(x, 7, A) as r tends to infinity. Lemma 3.3 
implies that the essential supremum of 5(x, r, A) is bounded above by 7//18, provided 
a negligible fcc-compatible function can be found. The strategy will be to define a 
negligible function, and then to solve an optimization problem in finitely many variables 
to establish that it is fec-compatible. 


Section 6 defines a compact topological space DS (the space of decomposition stars, 
Definition 6.2) and a continuous function o on that space, which is directly related to 
packings. 

If A is a saturated packing, then there is a geometric object D(v, A) constructed 
around each vertex v € A. D(v, A) depends on A only through the vertices in A that are 
at most a constant distance away from v. That constant is independent of v and A. The 
objects D(v, A) are called decomposition stars, and the space of all decomposition stars 
is precisely DS. Section 6.2 shows that the data in a decomposition star are sufficient 
to determine a Voronoi cell (2(D) for each D € DS. The same section shows that the 
Voronoi cell attached to D is related to the Voronoi cell of v in the packing by relation 


vol Q(v) = vol Q(D(v, A)). 


Section 7 defines a continuous real-valued function Ap: DS — R that assigns a “weight” 
to each decomposition star. The topological space DS embeds into a finite-dimensional 
Euclidean space. The reduction from an infinite-dimensional to a finite-dimensional 
problem is accomplished by the following results. 


Theorem 3.5. For each saturated packing A,andeachv € A, there is adecomposition 
star D(v, A) € DS such that the function A: A — R defined by 


A(v) = Ao(D(v, A)) 
is negligible for A. 


This is proved as Theorem 7.11. The main object of the proof is then to show that 
the function A is fec-compatible. This is implied by the inequality (in a finite number of 
variables) 


/32 < vol Q(D) + Ao(D), (3.3) 


for all D € DS. 

In the proof it is convenient to reframe this optimization problem by composing it with 
a linear function. The resulting continuous function 0: DS — R is called the scoring 
function, or score. 
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Let dt be the packing density of a regular tetrahedron. That is, let S be a regular 
tetrahedron of edge length 2. Let B be the part of S that lies within distance 1 of 
some vertex. Then dj is the ratio of the volume of B to the volume of S. We have 
Stet = V8 arctan(./2/5). 

Let doct be the packing density of a regular octahedron of edge length 2, again con- 
structed as the ratio of the volume of points within distance | of a vertex to the volume 
of the octahedron. 

The density of the fcc packing is a weighted average of these two ratios 


Tu Stet + 2WSoct 
/18 3 3 
This determines the exact value of 5, in terms of die. We have doc & 0.72. 
In terms of these quantities, 


16x 
0 (D) = —44oct(vol(Q2(D)) + Ao(D)) + a (3.4) 


Definition 3.6. We define the constant 


J2 a 
pt = 4arctan | — ]— ~. 
5 3 


Its value is approximately pt ~ 0.05537. Equivalent expressions for pf are 
a 4 
pt = V2 — 5 = 2 (25.c1 2 =) 


In terms of the scoring function o, the optimization problem in a finite number of 
variables (inequality (3.3)) takes the following form. The proof of this inequality is a 
central concern in this paper. 


Theorem 3.7 (Finite-Dimensional Reduction). The maximum of o on the topological 
space DS of all decomposition stars is the constant 8 pt © 0.442989. 


Remark 3.8. The Kepler conjecture is an optimization problem in an infinite number 
of variables (the coordinates of the points of A). The maximization of o on DS is an 
optimization problem in a finite number of variables. Theorem 3.7 may be viewed as a 
finite-dimensional reduction of the Kepler conjecture. 


Let f9 = 1.255 (2t9 = 2.51). This is a parameter that is used for truncation throughout 
this paper. 

Let U(v, A) be the set of vertices in A at nonzero distance at most 2f9 from v. From 
v and a decomposition star D(v, A) it is possible to recover U(v, A), which we write as 
U(D). We can completely characterize the decomposition stars at which the maximum 
of o is attained. 


90 


A Formulation of the Kepler Conjecture 27 


Theorem 3.9. Let D be a decomposition star at which the functiona: DS — R attains 
its maximum. Then the set U(D) of vectors at distance at most 2ty from the center has 
cardinality twelve. Up to Euclidean motion, U (D) is one of two arrangements: the kissing 
arrangement of the twelve balls around a central ball in the fcc packing or the kissing 
arrangement of twelve balls in the hcp. 


There is a complete description of all packings in which every sphere center is sur- 
rounded by twelve others in various combinations of these two patterns. All such pack- 
ings are built from parallel layers of the Az lattice. (The A2 lattice formed by equilateral 
triangles, is the optimal packing in two dimensions.) See the first paper I. 


3.2. Basic Concepts in the Proof 


To prove Theorems 3.1, 3.7, and 3.9, we wish to show that there is no counterexample. In 
particular, we wish to show that there is no decomposition star D with value 0 (D) > 8 pt. 
We reason by contradiction, assuming the existence of such a decomposition star. With 
this in mind, we call D a contravening decomposition star if 


o(D) > 8 pt. 


In much of what follows we tacitly assume that every decomposition star under discussion 
is a contravening one. Thus, when we say that no decomposition stars exist with a given 
property, it should be interpreted as saying that no such contravening decomposition 
stars exist. 

To each contravening decomposition star D, we associate a (combinatorial) plane 
graph G(D). A restrictive list of properties of plane graphs is described in Section 18.3. 
Any plane graph satisfying these properties is said to be tame. All tame plane graphs 
have been classified. There are several thousand, up to isomorphism. The list appears in 
[Hal6]. We refer to this list as the archival list of plane graphs. 

A few of the tame plane graphs are of particular interest. Every decomposition star 
attached to the fcc packing gives the same plane graph (up to isomorphism). Call it Ggcc. 
Likewise, every decomposition star attached to the hcp gives the same plane graph Gyep 
(see Fig. 3.2). 


Fig. 3.2. The plane graphs Gfec and Ghep. 
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Fig. 3.3. The plane graph Gpent of the pentahedral prism. 


There is one more tame plane graph that is particularly troublesome. It is the graph 
Gpent obtained from the pictured configuration of twelve balls tangent to a given central 
ball (Fig. 3.3). (Place a ball at the north pole, another at the south pole, and then form two 
pentagonal rings of five balls.) This case requires individualized attention. S. Ferguson 
proves the following theorem in Paper V. 


Theorem 3.10 (Ferguson). There are no contravening decomposition stars D whose 
associated plane graph is isomorphic to Gpent. 


3.3. Logical Skeleton of the Proof 


Consider the following six claims. Eventually we will give a proof of all six statements. 
First, we draw out some of their consequences. The main results (Theorems 3.1, 3.7, and 
3.9) all follow from these claims. 


Claim 3.11. [f the maximum of the function o on DS is 8 pt, then for every saturated 
packing A there exists a negligible fcc-compatible function A. 


Claim 3.12. Let D be a contravening decomposition star. Then its plane graph G(D) 
is tame. 


Claim 3.13. [faplane graph is tame, then it is isomorphic to one of the several thousand 
plane graphs that appear in the archival list of plane graphs. 


Claim 3.14. [f the plane graph of a contravening decomposition star is isomorphic to 
one in the archival list of plane graphs, then it is isomorphic to one of the following three 
plane graphs: Gyen, Gnep, OF Gfcc- 


Claim 3.15. There do not exist any contravening decomposition stars D whose asso- 
ciated graph is isomorphic to Gpent- 
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Claim 3.16. Contravening decomposition stars exist. If D is a contravening decompo- 
sition star, and if the plane graph of D is isomorphic to Gje¢ Or Ghcp, then o (D) = 8 pt. 
Moreover, up to Euclidean motion, U (D) is the kissing arrangement of the twelve balls 
around a central ball in the fcc packing or the kissing arrangement of twelve balls in 
the hcp. 


Next, we state some of the consequences of these claims. 


Lemma 3.17. Assume Claims 3.12-3.15. If D is a contravening decomposition star, 
then its plane graph G(D) is isomorphic to Ghep or Gee. 


Proof. Assume that D is a contravening decomposition star. Then its plane graph is 
tame, and consequently appears on the archival list of plane graphs. Thus, it must be 
isomorphic to one of Gfec, Gnep, OF Gpent- The final graph is ruled out by Claim 3.15. 


Lemma 3.18. Assume Claims 3.12—3.16. Then Theorem 3.7 holds. 


Proof. By Claim 3.16 and Lemma 3.17, the value 8 pf lies in the range of the function 
o on DS. Assume for a contradiction that there exists a decomposition star D € DS that 
has o(D) > 8 pt. By definition, this is a contravening star. By Lemma 3.17, its plane 
graph is isomorphic to Gyep or Gfcc. By Claim 3.16, o(D) = 8 pt, in contradiction with 
o(D) > 8 pt. 


Lemma 3.19. Assume Claims 3.12—3.16. Then Theorem 3.9 holds. 


Proof. By Theorem 3.7, the maximum of o on DS is 8 pt. Let D be a decomposition 
star at which the maximum 8 pf is attained. Then D is a contravening star. Lemma 3.17 
implies that the plane graph is isomorphic to Gycp or Gyee. The hypotheses of Claim 3.16 
are satisfied. The conclusion of Claim 3.16 is the conclusion of Theorem 3.9. 


Lemma 3.20. Assume Claims 3.11-3.16. Then the Kepler conjecture (Theorem 3.1) 
holds. 


Proof. As pointed out in Remark 3.4, the precise meaning of the Kepler conjecture is 
for every saturated packing A, the essential supremum of 5(x, r, A) is at most 2/18. 

Let A be the set of centers of a saturated packing. Let A: A — R be the negligible, 
fcc-compatible function provided by Claim 3.11 (and Lemma 3.18). By Lemma 3.3, the 
function A leads to a constant C such that for all r > 1 and all x € R°, the density 
5(x,r, A) satisfies 


d(x,r, A) < w/V 184+ C/r. 
This implies that the essential supremum of 5(x, 7, A) is at most 27/18. 


Remark 3.21. One other theorem (Theorem 3.5) was stated without proof in Sec- 
tion 3.1. This result was placed there to motivate the other results. However, it is not an 
immediate consequence of Claims 3.11—3.16. Its proof appears in Theorem 7.11. 
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3.4. Proofs of the Central Claims 


The previous section showed that the main results in Section 3.1 (Theorems 3.1, 3.7, and 
3.9) follow from six claims. This section indicates where each of these claims is proved, 
and mentions a few facts about the proofs. 

Claim 3.11 is proved in Theorem 7.14. Claim 3.12 is proved in Theorem 20.20. 
Claim 3.13, the classification of tame graphs, is proved in Theorem 19.1. By the classi- 
fication of such graphs, this reduces the proof of the Kepler conjecture to the analysis of 
the decomposition stars attached to the finite explicit list of tame plane graphs. We will 
return to Claim 3.14 ina moment. Claim 3.15 is Ferguson’s thesis, cited as Theorem 3.10. 

Claim 3.16 is the local optimality of the fcc and hcp. In Section 8, the necessary local 
analysis is carried out to prove Claim 3.16 as Corollary 8.3. 

Now we return to Claim 3.14. This claim is proved as Theorem 23.1. The idea of the 
proof is the following. Let D be a contravening decomposition star with graph G(D). 
We assume that the graph G(D) is not isomorphic to Gfec, Ghcp, and Gpent and then 
prove that D is not contravening. This is a case-by-case argument, based on the explicit 
archival list of plane graphs. 

To eliminate these remaining cases, more-or-less generic arguments can be used. A 
linear program is attached to each tame graph G. The linear program can be viewed 
as a linear relaxation of the nonlinear optimization problem of maximizing o over all 
decomposition stars with a given tame graph G. Because it is obtained by relaxing 
the constraints on the nonlinear problem, the maximum of the linear problem is an 
upper bound on the maximum of the original nonlinear problem. Whenever the linear 
programming maximum is less than 8 pt, it can be concluded that there is no contravening 
decomposition star with the given tame graph G. This linear programming approach 
eliminates most tame graphs. 

When a single linear program fails to give the desired bound, it is broken into a series 
of linear programming bounds, by branch and bound techniques. For every tame plane 
graph G other than Ghep, Grec, and Gpent, We produce a series of linear programs that 
establish that there is no contravening decomposition star with graph G. 

The proof is organized in the following way. Sections 4—7 introduce the basic defini- 
tions. Section 7 gives a proof of Claim 3.11. Section 8 proves Claim 3.16. Sections 9-14 
present the fundamental estimates. Sections 18-19 give a proof of Claim 3.13. Sec- 
tions 20-22 give a proof of Claim 3.12. Sections 23-25 give a proof of Claim 3.14. 
Claim 3.15 (Ferguson’s thesis) appears in Paper V. 


4. Construction of the Q-System 


It is useful to separate the parts of space of relatively high packing density from the 
parts of space with relatively low packing density. The Q-system, which is developed 
in this section, is a crude way of marking off the parts of space where the density is 
potentially high. The Q-system is a collection of simplices whose vertices are points 
of the packing A. The Q-system is reminiscent of the Delaunay decomposition, in the 
sense of being a collection of simplices with vertices in A. In fact, the Q-system is the 
remnant of an earlier approach to the Kepler conjecture that was based entirely on the 
Delaunay decomposition (see [Ha2]). However, the Q-system differs from the Delaunay 
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decomposition in crucial respects. The most fundamental difference is that the Q-system, 
while consisting of nonoverlapping simplices, does not partition all of space. 

This section defines the set of simplices in the Q-system and proves that they do 
not overlap. In order to prove this, we develop a long series of lemmas that study the 
geometry of intersections of various edges and simplices. At the end of this section we 
give the proof that the simplices in the Q-system do not overlap. 


4.1. Description of the Q-system 


Fix a packing of balls of radius 1. We identify the packing with the set A of its centers. 
A packing is thus a subset A of R? such that for all v, w € A, |v — w| < 2 implies 
v = w. The centers of the balls are called vertices. The term “vertex” will be reserved 
for this technical usage. A packing is said to be saturated if for every x € R’, there is 
some v € A such that |x — v| < 2. Any packing is a subset of a saturated packing. We 
assume that A is saturated. The set A is countably infinite. 


Definition 4.1. We define the truncation parameter to be the constant f9 = 1.255. It is 
used throughout. Informal arguments that led to this choice of constant are described in 
Paper I. 


Precise constructions that rely on the truncation parameter f will appear below. We 
will regularly intersect Voronoi cells with balls of radius fo to obtain lower bounds on 
their volumes. We will regularly disregard vertices of the packing that lie at distance 
greater than 2/9 from a fixed v € A to obtain a finite subset of A (a finite cluster of balls 
in the packing) that is easier to analyze than the full packing A. 

The truncation parameter is the first of many decimal constants that appear. Each 
decimal constant is an exact rational value, e.g., 27 = 251/100. They are not to be 
regarded as approximations of some other value. 


Definition 4.2. A quasi-regular triangle is a set T C A of three vertices such that if 
v,w €T then |w — v| < 2%. 


Definition 4.3. A simplex is a set of four vertices. A quasi-regular tetrahedron is a 
simplex S such that if v, w € S then |w — v| < 2%. A quarter is a simplex whose edge 
lengths y;,..., ye can be ordered to satisfy 2t9 < y; < «f 3,2 <y; <2t0,i =2,...,6. 
If a quarter satisfies the strict inequalities 2%) < y; < /8, then we say that it is a strict 
quarter. We call the longest edge {v, w} of a quarter its diagonal. When the quarter is 
strict, we also say that its diagonal is strict. When the quarter has a distinguished vertex, 
the quarter is upright if the distinguished vertex is an endpoint of the diagonal, and flat 
otherwise. 


At times, we identify a simplex with its convex hull. We say, for example, that the 
circumcenter of a simplex is contained in the simplex to mean that the circumcenter 
is contained in the convex hull of the four vertices. Similar remarks apply to triangles, 
quasi-regular tetrahedra, quarters, and so forth. We write || for the convex hull of S 
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when we wish to be explicit about the distinction between |S| and its set of extreme 


points. 

When we wish to give an order on an edge, triangle, simplex, etc., we present the 
object as an ordered tuple rather than a set. Thus, we refer to both (vj,..., v4) and 
{v,,..., v4} as simplices, depending on the needs of the given context. 


Definition 4.4. Two manifolds with boundary overlap if their interiors intersect. 


Definition 4.5. A set O of six vertices is called a quartered octahedron if there are four 
pairwise nonoverlapping strict quarters S|,..., S4 all having the same diagonal, such 
that O is the union of the four sets S; of four vertices. (It follows easily that the strict 
quarters S; can be given a cyclic order with respect to which each strict quarter S; has 
a face in common with the next, so that a quartered octahedron is literally a octahedron 
that has been partitioned into four quarters.) 


Remark 4.6. A quartered octahedron may have more than one diagonal of length less 
than /8, so its decomposition into four strict quarters need not be unique. The choice 
of diagonal has no particular importance. Nevertheless, to make things canonical, we 
pick the diagonal of length less than /8 with an endpoint of smallest possible value with 
respect to the lexicographical ordering on coordinates; that is, with respect to the ordering 
(v1, y2, Y3) < (Yj. Yas ¥3), ify; = yj, fori = 1,...,k, and yer < yy 41. This selection 
rule for diagonals is fully translation invariant in the sense that if one octahedron is a 
translate of another (whether or not they belong to the same saturated packing), then the 
selected diagonal of one is a translate of the selected diagonal of the other. 


Definition 4.7. If {v,, v2} is an edge of length between 2%) and /8, we say that a vertex 
v (4 vq, v2) is an anchor of {v1, v2} if its distances to v; and v2 are at most 2%. 


The two vertices of a quarter that are not on the diagonal are anchors of the diagonal, 
and the diagonal may have other anchors as well. 


Definition 4.8. Let Q be the set of quasi-regular tetrahedra and strict quarters, enu- 
merated as follows. This set is called the Q-system. It is canonically associated with a 
saturated packing A. (The Q stands for quarters and quasi-regular tetrahedra.) 


1. All quasi-regular tetrahedra. 

2. Every strict quarter such that none of the quarters along its diagonal overlaps any 
other quasi-regular tetrahedron or strict quarter. 

3. Every strict quarter whose diagonal has four or more anchors, as long as there are 
not exactly four anchors arranged as a quartered octahedron. 

4. The fixed choice of four strict quarters in each quartered octahedron. 

5. Every strict quarter {v;, v2, v3, v4} whose diagonal {v,, v3} has exactly three an- 
chors v2, v4, vs provided that the following hold (for some choice of indexing). 
(a) {v2, vs} is a strict diagonal with exactly three anchors: v1, v3, v4. 

(b) dog + dos > , where dq is the dihedral angle of the simplex {v1, v3, v2, v4} 
along the edge {v;, v3} and ds is the dihedral angle of the simplex {v 1, v3, v2, vs} 
along the edge {v,, v3}. 
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No other quasi-regular tetrahedra or strict quarters are included in the Q-system Q. 
The following theorem is the main result of this section. 


Theorem 4.9. For every saturated packing, there exists a uniquely determined Q- 
system. Distinct simplices in the Q-system have disjoint interiors. 


While proving the theorem, we give a complete classification of the various ways in 
which one quasi-regular tetrahedron or strict quarter can overlap another. 

Having completed our primary purpose of showing that the simplices in the Q-system 
do not overlap, we state the following small lemma. It is an immediate consequence of 
the definitions, but is nonetheless useful in the sections that follow. 


Lemma 4.10. Jf one quarter along a diagonal lies in the Q-system, then all quarters 
along the diagonal lie in the Q-system. 


Proof. This is true by construction. Each of the defining properties of a quarter in the 
Q-system is true for one quarter along a diagonal if and only if it is true of all quarters 
along the diagonal. 


4.2. Geometric Considerations 


Remark 4.11. The primary definitions and constructions of this paper are translation 
invariant. Thatis, if A € R? and A isa saturated packing, then A-+-A is a saturated packing. 
If A: A > Ris a negligible fcc-compatible function for A, then A + vb A(v) isa 
negligible fec-compatible function for A + A. If Q is the Q-system of A, then A + QO 
is the Q-system of A + A. Because of general translational invariance, when we fix our 
attention on a particular v € A, we often assume (without loss of generality) that the 
coordinate system is fixed in such a way that v lies at the origin. 


Our simplices are generally assumed to come labeled with a distinguished vertex, 
fixed at the origin. (The origin will always be at a vertex of the packing.) We number 
the edges of each simplex 1, ..., 6, so that edges 1, 2, and 3 meet at the origin, and the 
edges i and i + 3 are opposite, fori = 1, 2, 3. (See Fig. 4.1.) S(y1, yo, ..., ye) denotes 
a simplex whose edges have lengths y;, indexed in this way. We refer to the endpoints 
away from the origin of the first, second, and third edges as the first, second, and third 
vertices. 


Definition 4.12. In general, let dih(S) be the dihedral angle of a simplex S along its first 
edge. When we write a simplex in terms of its vertices (w1, w2, w3, w4), then {w), w2} 
is understood to be the first edge. 


Definition 4.13. We define the radial projection of a set X to be the radial projection 
x ++ x/|x| of X\0 to the unit sphere centered at the origin. We say the two sets cross if 
their radial projections to the unit sphere overlap. 
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Fig. 4.1. € measures the distance between the vertices at 0 and v. The standard indexing of the edges of a 
simplex is marked on the lower simplex. 


Definition 4.14. If S and S’ are nonoverlapping simplices with a shared face F, we 
define €(.S, S’) as the distance between the two vertices (one on S and the other on S’) 
that do not lie on F. We may express this as a function 


E(S, S’) = E(S(1, +s Y6)s Ye Yo, y4) 


of nine variables, where S = S(y,,..., 6) and S’ = S(y}, y5, ¥4, ¥4, Ys, Yo), positioned 
so that S and S’ are nonoverlapping simplices with a shared face F of edge-lengths 
(4, Ys, Yo). The function of nine variables is defined only for values (y;, y;) for which 
the simplices S and S’ exist (Fig. 4.1). 


Several lemmas in this paper rely on calculations of lower bounds to the function € 
in the special case when the edge between the vertices 0 and v passes through the shared 
face F. If intervals containing y),..., 96, ¥}, ¥5, ¥4 are given, then lower bounds on € 
over that domain are generally easy to obtain. Detailed examples of calculations of the 
lower bound of this function can be found in Section 4 of [Ha6]. 

To work one example, we suppose we are asked to give a lower bound on € when the 
simplex S = S(y1,..., y6) Satisfies y; > 2 and y4, ys, yo < 2to and S’=S(y}, y5, 3, ya, 
ys, Yo) satisfies y; > 2, fori = 1,...,3. Assume that the edge {0, v} passes through 
the face shared between S and S’, and that |v] < /8, where v is the vertex of S’ 
that is not on S. We claim that any pair S, S’ can be deformed by moving one ver- 
tex at a time until S is deformed into S(2, 2,2, 2to, 2to, 2t9) and S’ is deformed into 
S(2, 2, 2, 2t9, 2t, 2t9). Moreover, these deformations preserve the constraints (includ- 
ing that {0, v} passes through the shared face), and are nonincreasing in |v|. From the 
existence of this deformation, it follows that the original |v| satisfies 


|v] = E(S(Q, 2, 2, 2t0, 2fo, 2t0), 2, 2, 2). 


We produce the deformation in this case as follows. We define the pivot of a vertex 
v with respect to two other vertices {v;, v2} as the circular motion of v held at a fixed 
distance from v; and v2, leaving all other vertices fixed. The axis of the pivot is the line 
through the two fixed vertices. Each pivot of a vertex can move in two directions. Let 
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the vertices of S be {0, v1, v2, v3}, labeled so that |v;| = y,;. Let S’ = {v, v1, v2, v3}. 
We pivot v; around the axis through 0 and v2. By choice of a suitable direction for the 
pivot, vj; moves away from v and v3. Its distance to 0 and v2 remains fixed. We continue 
with this circular motion until |v; — v3| achieves its upper bound or the segment {v, v3} 
intersects the segment {0, v} (which threatens the constraint that the segment {0, v} must 
pass through the common face). (We leave it as an exercise” to check that the second 
possibility cannot occur because of the edge length upper bounds on both diagonals of 
/8. That is, there does not exist a convex planar quadrilateral with sides at least 2 and 
diagonals less than V8.) Thus, |v; — v3| attains its constrained upper bound 2/9. Similar 
pivots to v2 and v3 increase the lengths |v; — v2, |v2 — v3|, and |v3 — v;| to 2%. Similarly, 
v may be pivoted around the axis through v,; and v2 to decrease the distance to v3 and 
0 until the lower bound of 2 on |v — v3| is attained. Further pivots reduce all remaining 
edge lengths to 2. In this way we obtain a rigid figure realizing the absolute lower bound 
of |v|. A calculation with explicit coordinates gives |v| > 2.75. 

Because lower bounds are generally easily determined from a series of pivots through 
arguments such as this one, we state them without proof. We state that these bounds were 
obtained by geometric considerations, to indicate that the bounds were obtained by the 
deformation arguments of this paragraph. 


4.3. Incidence Relations 


Lemma 4.15. Let v, v,, v2, v3, and v4 be distinct points in R? with pairwise distances 
at least 2. Suppose that \v; — v;| < 2to fori # j and {i, j} # {1, 4}. Then v does not lie 
in the convex hull of {v1, v2, V3, v4}. 


Proof. This lemma is proved in Lemma 3.5 of [Ha6]. 


Lemma 4.16. Let S be a simplex whose edges have length between 2 and 22. Suppose 
that v has distance at least 2 from each of the vertices of S. Then v does not lie in the 
convex hull of S. 


Proof. Assume for a contradiction that v lies in the convex hull of S. Place a unit sphere 
around v. The simplex S partitions the unit sphere into four spherical triangles, where 
each triangle is the intersection of the unit sphere with the cone over a face of S, centered 
at v. By the constraints on the lengths of edges, the arclength of each edge of the spherical 
triangle is at most 7/2. (7/2 is attained when v has distance 2 to two of the vertices, and 
these two vertices have distance 2\/2 between them.) A spherical triangle with edges of 
arclength at most 7/2 has area at most 2/2. In fact, any such spherical triangle can be 
placed inside an octant of the unit sphere, and each octant has area 2/2. This partitions 
the sphere of area 47: into four regions of area at most 2/2. This is absurd. 


Corollary 4.17. No vertex of the packing is contained in the convex hull of a quasi- 
regular tetrahedron or quarter (other than the vertices of the simplex). 


? Compare Lemma 4.21. 
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Proof. The corollary is immediate. O 


Definition 4.18. Let v,, v2, w), w2, w3 € A be distinct. We say that an edge {v), v2} 
passes through the triangle {w1, w2, w3} if the convex hull of {v;, v2} meets some point 
of the convex hull of {w), w2, w3} and if that point of intersection is not any of the 
extreme points v1, V2, Wi, W2, W3. 


Lemma 4.19. An edge of length 2to or less cannot pass through a triangle whose edges 
have lengths 2to, 2to, and V8 or less. 


Proof. The distance between each pair of vertices is at least 2. Geometric considerations 
show that the edge has length at least 


EUS(2, 2. 2, Dig Dig 4/8), 22 By Sip; 


Definition 4.20. Let (x, y, z) denote the circumradius of a triangle with edge-lengths 
x, y, and z. 


Lemma 4.21. Suppose that the circumradius of {v,, v2, v3} is less than J/2. Then an 
edge {w1, w2} C A of length at most s/8 cannot pass through the face. 


Proof. Assume for a contradiction that {w , w2} passes through the triangle {v1, v2, v3}. 
By geometric considerations, the minimal length for {w), w2} occurs when |w; —v;| = 2, 
fori = 1,2, 7 = 1, 2,3. This distance constraint places the circumscribing circle of 
{v,, V2, v3} on the sphere of radius 2 centered at w, (resp. w2). Ifr < ./2 is the circum- 
radius of {v1, v2, v3}, then for this extremal configuration we have the contradiction 


V8 > |w) — w2| = 274-72 > V8. 


Lemma 4.22. [fan edge of length at most \/8 passes through a quasi-regular triangle, 
then each of the two endpoints of the edge is at most 2.2 away from each of the vertices 
of the triangle (see Fig. 4.2). 


Proof. Let the diagonal edge be {vo, vp} and let the vertices of the face be {v1, v2, v3}. 
If |v; — vo| > 2.2 or |v; — vo| > 2.2 for some i > 0, then geometric considerations give 
the contradiction 


[uo — vu} | = E(S(2, 2, 2, 2%, 2t, 2to), 2, 2, 2.2) > VB. 


Lemma 4.23. Suppose S and S' are quasi-regular tetrahedra that share a face. Suppose 
that the edge e between the two vertices that are not shared has length at most 8. Then 
the convex hull of S and S' consists of three quarters with diagonal e. No other quarter 
overlaps S or S’. 
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Vo 


(a) (b) 


V3 V3 
vi 


, 
Wo vi 


Fig. 4.2. (a) Two quasi-regular tetrahedra that share a face. (b) The same convex body viewed as three 
quarters that share a diagonal. 


Proof. Suppose that S and S’ are adjacent quasi-regular tetrahedra with a common face 
F. By Lemma 4.22, each of the six external faces of this pair of quasi-regular tetrahedra 
has circumradius at most 7(2.2, 2.2, 2%) < V2. A diagonal of a quarter cannot pass 
through a face of this size by Lemma 4.21. This implies that no other quarter overlaps 
these quasi-regular tetrahedra. 


Lemma 4.24. Suppose an edge {w, w>} of length at most V’8 passes through the face 
formed by a diagonal {v,, v2} and one of its anchors. Then w, and wz are also anchors 


of {v1, U2}. 
Proof. This follows from the inequality 


E (S(O, 2,2, »/8, 2to, 21), 2, 2, Bio) > 4/8 


and geometric considerations. 


Definition 4.25. Let A be a saturated packing. Assume that the coordinate system is 
fixed in such a way that the origin is a vertex of the packing. The height of a vertex is its 
distance from the origin. 


Definition 4.26. We say that a vertex is enclosed over a figure if it lies in the interior 
of the cone at the origin generated by the figure. 


Definition 4.27. An adjacent pair of quarters consists of two quarters sharing a face 
along a common diagonal. The common vertex that does not lie on the diagonal is called 
the base point of the adjacent pair. (When one of the quarters comes with a marked 
distinguished vertex, we do not assume that this marked vertex coincides with the base 
point of the pair.) The other four vertices are called the corners of the configuration. 


101 


38 T. C. Hales and S. P. Ferguson 


Definition 4.28. If the two corners, v and w, that do not lie on the diagonal satisfy 
|w — v| < V8, then the base point and four corners can be considered as an adjacent 
pair in a second way, where {v, w} functions as the diagonal. In this case we say that the 
original diagonal and the diagonal {v, w} are conflicting diagonals. 


Definition 4.29. A quarter is said to be isolated if it is not part of an adjacent pair. Two 
isolated quarters that overlap are said to form an isolated pair. 


Lemma 4.30. Suppose that there exist four nonzero vertices v1, ..., v4 of height at 
most 2to (that is, |v;| < 2t9) forming a skew quadrilateral. Suppose that the diagonals 
{v1, v3} and {v2, v4} have lengths between 2to and /8. Suppose the diagonals {v;, v3} 
and {v2, v4} cross. Then the four vertices are the corners of an adjacent pair of quarters 
with base point at the origin. 


Proof. Set d, = |v; — v3| and d; = |v2 — v4|. By hypothesis, d, and d, are at most J/8. 
If |v; — v2| > 29, geometric considerations give the contradiction 
max(d1, dy) > E(S(2to, 2, 2, 2to, V8, 2t0), 2, 2,2) > V8 > max(dj, dy). 


Thus, {0, vj, v2} is a quasi-regular triangle, as are {0, v2, v3}, {0, v3, va}, and {0, va, v1} 
by symmetry. 


Lemma 4.31. Jf, under the same hypotheses as Lemma 4.30, there is a vertex w of 
height at most \/8 enclosed over the adjacent pair of quarters, then {0, v1, ..., v4, w} 
is a quartered octahedron. 


Proof. If the enclosed w lies over say {0, v1, V2, v3}, then Jw — v4|, |w — v3] < 2% 
(Lemma 4.24), where {v,, v3} is a diagonal. Similarly, the distance from w to the other 
two corners is at most 2fo. 


Lemma 4.32. Let v; and v2 be anchors of {0, w} with 2t) < |w| < /8. If an edge 


{v3, v4} passes through both faces, {0, w, v1} and {0, w, v2}, then |v3 — v4| > J8. 


Proof. Suppose the figure exists with |v3 — v4| < 8. Label vertices so v3 lies on the 
same side of the figure as v;. Contract {v3, v4} by moving v3 and v4 until {v;, vu} has 
length 2, for u = 0, w, vj-2, andi = 3,4. Pivot w away from v3 and v4 around the 
axis {v,, v2} until |w| = 8. Contract {v3, v4} again. By stretching {v;, v2}, we obtain 
a square of edge two and vertices {0, v3, w, v4}. Short calculations based on explicit 
formulas for the dihedral angle and its partial derivatives give 


dih(S(V/8, 2, y3,2, y5,2)) > 1.075, y3, ys € [2, 2%], (4.1) 
dih(S(W/8, y2, y3, 2, ¥5, Yo) > 1, 25 ¥3, Ys, Yo € [2, 2ho]. (4.2) 

Then 
x > dih(0, w, v3, vj) + dih(O, w, v1, v2) + dih(O, w, v2, v4) > 1.075 +1+ 1.075 > x. 


Therefore, the figure does not exist. 
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Lemma 4.33. Two vertices w, w' of height at most \/8 cannot be enclosed over a 
triangle {v1, V2, v3} satisfying |v, — v2| < a e, |vy — v3| < 2f9, and |v2 — v3| < 2fo. 


Proof. For acontradiction, assume the figure exists. The long edge {v1, v2} must have 
length at least 27) by Lemma 4.22. This diagonal has anchors {0, v3, w, w’}. Assume that 
the cyclic order of vertices around the line {v, v2} is 0, v3, w, w’. We see that {v,, w} is 
too short to pass through {0, v2, w’}, and w is not inside the simplex {0, v;, v2, w’}. Thus, 
the projections of the edges {v2, w} and {0, w’} to the unit sphere at v; must intersect. 
It follows that {0, w’} passes through {v1, v2, w}, or {v2, w} passes through {v1, 0, w’}. 
However, {v2, w} is too short to pass through {v;, 0, w’}. Thus, {0, w’} passes through 
both {v;, v2, w} and {v,, v2, v3}. Lemma 4.32 gives the contradiction |w’| > J/8. 


Lemma 4.34. Let v1, v2, v3 be anchors of {0, w}, where 2to < |w| < J/8, |vj — v3| < 
/8, and the edge {v,, v3} passes through the face {0, w, v2}. Then min(|v1 — v2|, |v2 — 
v3|) < 2. Furthermore, if the minimum is 2to, then |v; — v2| = |v2 — v3| = 2fp. 


Proof. Assume min > 2f9. As in the proof of Lemma 4.32, we may assume that 
(0, v1, w, v3) is asquare. We may also assume, without loss of generality, that |w —v2| = 
|v2| = 2fo. This forces |vz — v;| = 2f0, fori = 1, 3. This is rigid, and is the unique figure 
that satisfies the constraints. The lemma follows. 


4.4. Overlap of Simplices 


This section gives a proof of Theorem 4.9 (simplices in the Q-system do not over- 
lap). This is accomplished in a series of lemmas. The first of these treats quasi-regular 
tetrahedra. 


Lemma 4.35. A quasi-regular tetrahedron does not overlap any other simplex in the 
Q-system. 


Proof. Edges of quasi-regular tetrahedra are too short to pass through the face of another 
quasi-regular tetrahedron or quarter (Lemma 4.19). If a diagonal of a strict quarter passes 
through the face of a quasi-regular tetrahedron, then Lemma 4.23 shows that the strict 
quarter is one of three joined along a common diagonal. This is not one of the enumerated 
types of strict quarter in the Q-system. 


Lemma 4.36. A quarter in the Q-system that is part of a quartered octahedron does 
not overlap any other simplex in the Q-system. 


Proof. By construction, the quarters that lie along a different diagonal of the octahedron 
do not belong to the Q-system. Edges of length at most 2f) are too short to pass through 
an external face of the octahedron (Lemma 4.19). A diagonal of a strict quarter cannot 
pass through an external face either, because of Lemma 4.22. 
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Lemma 4.37. Let Q be a strict quarter that is part of an adjacent pair. Assume that 
Q is not part of a quartered octahedron. If Q belongs to the Q-system, then it does not 
overlap any other simplex in the Q-system. 


The proof of this lemma will give valuable details about how one strict quarter overlaps 
another. 


Proof. Fix the origin at the base point of an adjacent pair of quarters. We investigate the 
local geometry when another quarter overlaps one of them. (This happens, for example, 
when there is a conflicting diagonal in the sense of Definition 4.27.) 

Label the base point of the pair of quarters up, and the four corners v1, v2, v3, v4, with 
{v;, v3} the common diagonal. Assume that |v; — v3| < J/8. 

If two quarters overlap then a face on one of them overlaps a face on the other. 
By Lemmas 4.33 and 4.32, we actually have that some edge (in fact the diagonal) of 
each passes through a face of the other. This edge cannot exit through another face by 
Lemma 4.32 and it cannot end inside the simplex by Corollary 4.17. Thus, it must end 
at a vertex of the other simplex. We break the proof into cases according to which vertex 
of the simplex it terminates at. In Case | the edge has the base point as an endpoint. In 
Case 2, the edge has a corner as an endpoint. 


Case 1. The edge {0, w} passes through the triangle {v,, v2, v3}, where {0, w} is a 
diagonal of a strict quarter. Lemma 4.24 implies that v; and v3 are anchors of {0, w}. 
The only other possible anchors of {0, w} are v2 or v4, for otherwise an edge of length 
at most 2f9 passes through a face formed by {0, w} and one of its anchors. If both v2 
and v4 are anchors, then we have a quartered octahedron, which has been excluded by 
the hypotheses of the lemma. Otherwise, {0, w} has at most three anchors: vj, v3, and 
either v2 or v4. In fact, it must have exactly three anchors, for otherwise there is no 
quarter along the edge {0, w}. So there are exactly two quarters along the edge {0, w}. 
There are at least four anchors along {v1, v3}: 0, w, v2, and v4. The quarters along the 
diagonal {v;, v3} lie in the Q-system. (None of these quarters is isolated.) The other two 
quarters, along the diagonal {0, w}, are not in the Q-system. They form an adjacent pair 
of quarters (with base point v4 or v2) that has conflicting diagonals, {0, w} and {v,, v3}, 
of length at most V8. 


Case 2. {v2, v4} is a diagonal of length less than J/8 (conflicting with {v1, v3}). (Note 
that if an edge of a quarter passes through the shared face of an adjacent pair of quarters, 
then that edge must be {v2, v4}, so that Cases | and 2 are exhaustive.) The two diagonals 
{v;, v3} and {v2, v4} do not overlap. By symmetry, we may assume that {v2, v4} passes 
through the face {0, v;, v3}. Assume (for a contradiction) that both diagonals have an 
anchor other than 0 and the corners v;. Let the anchor of {v2, v4} be denoted v24 and that 
of {v;, v3} be v13. Assume the figure is not a quartered octahedron, so that vj3 ~ v24. By 
Lemma 4.19, it is impossible to draw the edges {v1, v3} and {v13, v3} between v; and 
v3. In fact, if the edges pass outside the quadrilateral {0, v2, v24, va}, one of the edges 
of length at most 2/0 (that is, {0, vo}, {v2, voa}, {vo4, v4}, or {v4, O}) violates the lemma 
applied to the face {v;, v3, vj3}. If they pass inside the quadrilateral, one of the edges 
{v1, V13}, {v13, v3} violates the lemma applied to the face {0, v2, v4} or {v24, V2, v4}. We 
conclude that at most one of the two diagonals has additional anchors. 
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If neither of the two diagonals has more than three anchors, we have nothing more 
than two overlapping adjacent pairs of quarters along conflicting diagonals. The two 
quarters along the lower edge {v2, v4} lie in the Q-system. Another way of expressing 
this “lower-edge” condition is to require that the two adjacent quarters Q, and Q; satisfy 
dih(Q;)+dih(Q2) > 2, when the dihedral angles are measured along the diagonal. The 
pair (Q', Q4) along the upper edge will have dih(Q‘) + dih(Q4) < z. 

If there is a diagonal with more than three anchors, the quarters along the diagonal 
with more than three anchors lie in the Q-system. Any additional quarters along the 
diagonal {v2, v4} belong to an adjacent pair. Any additional quarters along the diagonal 
{v,, v3} cannot intersect the adjacent pair along {v2, v4}. Thus, every quarter intersecting 
an adjacent pair also belongs to an adjacent pair. 

In both possibilities of Case 2, the two quarters left out of the Q-system correspond 
to a conflicting diagonal. 


Remark 4.38. We have seen in the proof of Lemma 4.37 that if a strict quarter Q 
overlaps a strict quarter that is part of an adjacent pair, then Q is also part of an adjacent 
pair. Thus, if an isolated strict quarter overlaps another strict quarter, then both strict 
quarters are necessarily isolated. 


Lemma 4.39. [fan isolated strict quarter Q overlaps another strict quarter, then the 
diagonal of Q has exactly three anchors. 


The proof of the lemma will give detailed information about the geometrical config- 
uration that is obtained when an isolated quarter overlaps another strict quarter. 


Proof. Assume that there are two strict quarters Q,; and Q> that overlap. Following 
Remark 4.38, assume that neither is adjacent to another quarter. Let {0, uw} and {v1, v2} 
be the diagonals of Q; and Qo. Suppose the diagonal {v;, v2} passes through a face 
{0, u, w} of Q;. By Lemma 4.24, v; and v2 are anchors of {0, uv}. Again, either the 
length of {v;, w} is at most 2% or the length of {v2, w} is at most 2%, say {w, v2} (by 
Lemma 4.34). It follows that Q; = {0, u, w, v2} and |v; — w| > 2%. (Q is not adjacent 
to another quarter.) So w is not an anchor of {v;, v2}. 

Let {v1, v2, w’} be a face of Q> with w’ ¥ 0, u. If {v1, w’, v2} does not link {0, u, w}, 
then {v,, w’} or {v2, w’} passes through the face {0,u, w}, which is impossible by 
Lemma 4.19. So {v1, v2, w’} links {0, u, w} and an edge of {0, u, w} passes through 
the face {v;, v2, w’}. It is not the edge {u, w} or {0, w}, for they are too short by 
Lemma 4.19. So {0, u} passes through {w’, v;, v2}. The only anchors of {v;, v2} (other 
than w’) are u and 0 (by Lemma 4.32). Either {u, w’} or {w’, 0} has length at most 2%) by 
Lemma 4.34, but not both, because this would create a quarter adjacent to Q2. By sym- 
metry, Q2 = {v1, v2, w’, O} and the length of {u, w’} is greater than 2f9. By symmetry, 
{0, uw} has no other anchors either. This determines the local geometry when there are two 
quarters that intersect without belonging to an adjacent pair of quarters (see Fig. 4.3). It 
follows that the two quarters form an isolated pair. 


Isolated quarters that overlap another strict quarter do not belong to the Q-system. 
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V2 


0 


Fig. 4.3. An isolated pair. The isolated pair consists of two simplices Q; = {0,u,w,v2} and Q2 = 
{0, w’, vj, v2}. The six extremal vertices form an octahedron. This is not a quartered octahedron because 
the edges {u, w’} and {w, v;} have length greater than 2f9. 


We conclude with the proof of the main theorem of the section. 


Proof of Theorem 4.9. The rules defining the Q-system specify a uniquely determined 
set of simplices. The proof that they do not overlap is established by the preceding 
series of lemmas. Lemma 4.35 shows that quasi-regular tetrahedra do not overlap other 
simplices in the Q-system. Lemma 4.36 shows that the quarters in quartered octahedra 
are well-behaved. Lemma 4.37 shows that other quarters in adjacent pairs do not overlap 
other simplices in the Q-system. Finally, we treat isolated quarters in Lemma 4.39. 
These cases cover all possibilities since every simplex in the Q-system is a quasi-regular 
tetrahedron or strict quarter, and every strict quarter is either part of an adjacent pair or 
isolated. O 


5. V-Cells 


In the proof of the Kepler conjecture we make use of two quite different structures in 
space. The first structure is the Q-system, which was defined in the previous section. 
It is inspired by the Delaunay decomposition of space and consists of a nonoverlap- 
ping collection of simplices that have their vertices at the points of A. Historically, the 
construction of the nonoverlapping simplices of the Q-system grew out of a detailed 
investigation of the Delaunay decomposition. 

The second structure is inspired by the Voronoi decomposition of space. In the Voronoi 
decomposition, the vertices of A are the centers of the cells. It is well known that 
the Voronoi decomposition and Delaunay decomposition are dual to one another. Our 
modification of Voronoi cells are called V-cells. 

In general, it is not true that a Delaunay simplex is contained in the union of the Voronoi 
cells at its four vertices. This incompatibility of structures adds a few complications to 
Rogers’s elegant proof of a sphere packing bound [Ro1]. In this section we show that V- 
cells are compatible with the Q-system in the sense that each simplex in the Q-system 
is contained in the union of the V-cells at its four vertices (Lemma 5.28). A second 
compatibility result between these two structures is proved in Lemma 5.29. 
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The purpose of this section is to define V-cells and to prove the compatibility results 
just mentioned. In the proof of the Kepler conjecture it is important to keep both structures 
(the Q-system and the V-cells) continually at hand. We frequently jump back and forth 
between these dual descriptions of space in the course of the proof. In Section 6 we 
define a geometric object (called the decomposition star) around a vertex that encodes 
both structures. The decomposition star will become our primary object of analysis. 


5.1. V-Cells 


Definition 5.1. The Voronoi cell Q(v) around a vertex v € A is the set of points closer 
to v than to any other vertex. 


Definition 5.2. We construct a set of triangles 6 in the packing. The triangles in this 
set are called barriers. A triangle {v;, v2, v3} with vertices in the packing belongs to B 
if and only if one or more of the following properties hold: 


1. The triangle is a quasi-regular. 
2. The triangle is a face of a simplex in the Q-system. 


Lemma 5.3. No two barriers overlap; that is, no two open triangular regions of B 
intersect. 


Proof. (fthere is overlap, an edge {w , w>2} of one triangle passes through the interior of 
another {v), v2, v3}. Since |w, — w2| < /8, we have that the circumradius of {v1, v2, v3} 
is at least /2 by Lemma 4.21 and that the length |w,; — w2| is greater than 21) by 
Lemma 4.19. If the edge {w,, w2} belongs to a simplex in the Q-system, the simplex 
must be a strict quarter. If {v;, v2, v3} has edge lengths at most 2%), then Lemma 4.22 
implies that |w; — v;| < 2.2 fori = 1,2 and j = 1, 2, 3. The simplices {v,, v2, v3, wy} 
and {v1, 2, V3, W2} form a pair of quasi-regular tetrahedra. We conclude that {v;, v2, v3} 
is a face of a quarter in the Q-system. Since, the simplices in the Q-system do not overlap, 
the edge {w 1, w2} does not belong to a simplex in the Q-system. The result follows. O 


Definition 5.4. We say that a point y is obstructed at x € R? if the line segment from x 
to y passes through the interior of a triangular region in B. Otherwise, y is unobstructed 
at x. The “obstruction” relation between x and y is clearly symmetric. 


For each w € A, let I, be the cube of side 4, with edges parallel to the coordinate 
axes, centered at w. Thus, 


To = {Q1, Y2, v3): [wil] S 2, 7 = 1, 2, 3}. 


I, has diameter 4./3 and I,, C B(w, 23). Let R3’ be the subset of x € IR? for which x 
is not equidistant from any two v, w € A(x, 2/3) = B(x, 2/3) A. The subset R?’ is 
dense in R*, and is obtained locally around a point x by removing finitely many planes 
(perpendicular bisectors of {v, w}, for v, w € B(x, 2/3)). For x € R?’, the vertices of 
A(x, 2V3) can be strictly ordered by their distance to x. 
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Definition 5.5. Let A be a saturated packing. We define a map g: R*’ > A such that 
the image of x lies in A(x, 23). If x € R®’, let 


A, = {w € A: x € I, and w is unobstructed at x}. 


If A, = G, then let g(x) be the vertex of A(x, 2/3) closest to x. (Since A is saturated, 
A(x, 2/3) is nonempty.) If A, is nonempty, then let g(x) be the vertex of A, closest 
to x. 


Definition 5.6. For v € A, let VC(v) be defined as the closure of g~!(v) in R?. We 
call it the V-cell at v. 


Remark 5.7. In a saturated packing, the Voronoi cell at v will be contained in a ball 
centered at v of radius 2. Hence J, contains the Voronoi cell at v. By construction, the 
V-cell at v is confined to the cube I. The cubes J, were introduced into the definition 
of g with the express purpose of forcing V-cells to be reasonably small. Had the cubes 
been omitted from the construction, we would have been drawn to frivolous questions 
such as whether the closest unobstructed vertex to some x € R* might be located in a 
remote region of the packing. 


The set of V-cells is our promised decomposition of space. 


Lemma 5.8. V-cells cover space. The interiors of distinct V -cells are disjoint. Each 
V-cell is the closure of its interior. 


Proof. The sets gy '(v), for v € A, cover R3’. Their closures cover R?. The other 
statements in the lemma follow from the fact that a V-cell is a union of finitely many 
nonoverlapping, closed, convex polyhedra. This is proved in the following lemma. 


Lemma 5.9. Each V -cell is a finite union of nonoverlapping convex polyhedra. 


Proof. During this proof, we ignore sets of measure zero in R? such as finite unions of 
planes. Thus, we present the proof as if each point belongs to exactly one Voronoi cell, 
although this fails on an inconsequential set of measure zero in R°. 

It is enough to show that if E C R? is an arbitrary unit cube, then the V-cell decom- 
position of space within E consists of finite unions of nonoverlapping convex polyhedra. 
Let X¢ be the set of w € A such that /,, meets E. Included in Xz is the set of w whose 
Voronoi cells cover E. The rules for V-cells assign x € E to the V-cell centered at some 
we Xz. 

Let d be an upper bound on the distance between a vertex in Xg¢ and a point of E. 
By the pythagorean theorem, we may take d = (1 + 2)./3. Let Bg be the set of barriers 
with a vertex at most distance d from some point in E. 

For each pair {u, v} of distinct vertices of Xz, draw the perpendicular bisecting 
plane of {u, v}. Draw the plane through each barrier in Bg. Draw the plane through 
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Fig. 5.1. A hypothetical arrangement that leads to a nonconvex V-cell at v. 


each triple {u, v, w}, where u € Xz and {v, w} are two of the vertices of a barrier 
in Bg. These finitely many planes partition EF into finitely many convex polyhedra. 
The ranking of distances from x to the points of X¢ is constant for all x in the in- 
terior of any fixed polyhedron. The set of w € X¢ that are obstructed at x is con- 
stant on the interior of any fixed polyhedron. Thus, by the rules of construction of 
V-cells, for each of these convex polyhedra, there is a V-cell that contains it. The result 
follows. O 


Remark 5.10. A number of readers of the first version of this manuscript presumed 
that V-cells were necessarily star-convex, in large part because of the inapt name “de- 
composition star” for a closely related object. The geometry of a V-cell is significantly 
more complex than that of a Voronoi cell. Nowhere do we make a general claim that 
all V-cells are convex, star-convex, or even connected. In Fig. 5.1 we depict a hypo- 
thetical case in which the V-cell at v is potentially disconnected. (This figure is merely 
hypothetical, because I have not checked whether it is possible to satisfy all the metric 
constraints needed for it to exist.) The shaded triangle represents a barrier. The point x 
is obstructed by the shaded barrier at w. If x and y lie closer to w than to v, if v is the 
closest unobstructed vertex to x, if w is the closest unobstructed vertex to y, if x, y, and 
z are all unobstructed at v, and if z lies closer to v than to w, then it follows that x and 
z lie in the V-cell at v, but that the intervening point y does not. Thus, if all of these 
conditions are satisfied, the V-cell at v is not star-shaped at v. 


Remark 5.11. Although we have not made a detailed investigation of the subtleties of 
the geometry of V-cells, we face a practical need to give explicit lower bounds on the 
volume of V-cells. Possible geometric pathologies are avoided in the proof by the use 
of truncation. (To obtain lower bounds on the volume of V-cells, parts of the V-cell can 
be discarded.) For example, Lemma 5.23 shows that inside B(v, fo), the V-cell and the 
Voronoi cell are equal. 

In general, truncation will discard points x of V-cells where A, = J. These estimates 
also discard points of the V-cell that are not part of a star-shaped subset of the V-cell 
(to be defined later). 

Truncation will be justified later in Lemma 7.18, which shows that the term in- 
volving the volume of V-cells in the scoring function o has a negative coefficient, so 
that by decreasing the volume through truncation, we obtain an upper bound on the 
function o. 
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5.2. Orientation 


We introduce the concept of the orientation of a simplex and study its basic properties. 
The orientation of a simplex will be used to establish various compatibilities between 
V-cells. 


Definition 5.12. We say that the orientation of the face of a simplex is negative if the 
plane through that face separates the circumcenter of the simplex from the vertex of the 
simplex that does not lie on the face. The orientation is positive if the circumcenter and 
the vertex lie on the same side of the plane. The orientation is zero if the circumcenter 
lies in the plane. 


Lemma 5.13. At most one face of a quarter Q has negative orientation. 


Proof. The proof applies to any simplex with nonobtuse faces. (All faces of a quarter 
are acute.) Fix an edge and project Q orthogonally to a triangle in a plane perpendicular 
to that edge. The faces F; and F2 of Q along the edge project to edges e; and e2 of the 
triangular projection of Q. The line equidistant from the three vertices of F; projects 
to a line perpendicular to e;, for i = 1,2. These two perpendiculars intersect at the 
projection of the circumcenter of Q. If the faces of Q are nonobtuse, the perpendiculars 
pass through the segments e; and e respectively; and the two faces F; and F cannot 
both be negatively oriented. 


Definition 5.14. Define the polynomial x by 


XX... 5X6) = Xp X4Xs + Xp XGX4 + XQXEX5 + XQX4Xs + X5X3XG 


+ x3X4X6 — 2X5X6X4 — Hix — Xoxe - Hene- 


In applications of x, we have x; = ae where (y1,..., Ye) are the lengths of the edges 
of a simplex. 


Lemma 5.15. A simplex S(y1, ..., yo) has negative orientation along the face indexed 
by (4, 5, 6) if and only if x(y7, ..., yz) < 0. 


Proof. (This lemma is asserted without proof in [Ha6].) Let x; = oe Represent the 
simplex as S = {0, v1, v2, v3}, where {0, v;} is the ith edge. Write n = (v1; — v3) x (v2 — 
v3), anormal to the plane {v;, v2, v3}. Let c be the circumcenter of §. We can solve for 
a unique ¢ € R such that c + tn lies in the plane {vj, v2, v3}. The sign of t gives the 
orientation of the face {v;, v2, v3}. We find by direct calculation that 


t — x (x1 PS Se x6) 
JACI, «+, X6)U(X4, X5, X6) 
where the terms A and wu in the denominator are positive whenever x; = y?, where 
(y1,---, 6) are the lengths of edges of a simplex (see Section 8.1 of [Ha6]). Thus, ¢ and 
x have the same sign. The result follows. O 
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Lemma 5.16. Let F be a set of three vertices. Assume that one edge between pairs of 
vertices has length between 2to and \/8 and that the other two edges have length at most 
2to. Let v be any vertex not on Q. If the simplex (F,, v) has negative orientation along 
F, then it is a quarter. 


Proof. The orientation of F is determined by the sign of the function x (see Lemma 5.15). 
The face F is an acute or right triangle. Note that 0x /dx; = x4(—x4 + x5 +6). By the 
law of cosines, —x4 + x5 + x6 > 0 for an acute triangle. Thus, we have monotonicity in 
the variable x;, and the same is true of x2, and x3. Also, x is quadratic with a negative 
leading coefficient in each of the variables x4, x5, x6. Thus, to check positivity, when any 
of the lengths is greater than 2f, it is enough to evaluate 


xO 2 Ax oy 2: YO ACD Xe e), (42.2 a 2), 


for x € [2, 2fo], y € [2, fo], and z € [2%, 4/8], and verify that these values are nonnega- 
tive. (The minimum, which must be attained at a corner of the domain, is 0.) O 


Lemma 5.17. Let {v1, v2, v3} be a quasi-regular triangle. Let v be any other vertex. 
If the simplex S = {v, v1, v2, v3} has negative orientation along {v,, V2, v3}, then S is a 
quasi-regular tetrahedron and |v — v;| < 2to. 


Proof. The proof is similar to the proof of Lemma 5.16. It comes down to checking 
that 


Ye 2 Ae oy eS 0: 


for x, y, z € [2, 2%]. O 


Lemma 5.18. Jfaface ofa simplex has circumradius less than /2, then the orientation 
is positive along that face. 


Proof. If the face has circumradius less than /2, by monotonicity 


Xz, «0 VO) = X44, 4, yd, VS, VS) = 2yd ye y2(2/n(y4, ys, yo)? — 1) > 0. 


(Here y; are the edge-lengths of the simplex.) O 


5.3. Interaction of V-cells with the Q-system 


We study the structure of one V-cell, which we take to be the V-cell at the origin v = 0. 
Let Q be the set of simplices in the Q-system. For v € A, let QO, be the subset of those 
with a vertex at v. 


Lemma 5.19. Jf x lies in the (open) Voronoi cell at the origin, but not in the V -cell at 


the origin, then there exists a simplex Q € Qo, such that x lies in the cone (at 0) over 
Q. Moreover, x does not lie in the interior of Q. 
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Proof. (fx lies in the open Voronoi cell at the origin, then the segment {tf x: 0 < t < 1} 
lies in the Voronoi cell as well. By the definition of the V-cell, there is a barrier {v;, v2, v3} 
that the segment passes through. If the simplex Q = {0, v;, v2, v3} were to have positive 
orientation with respect to the face {v1, v2, v3}, then the circumcenter of {0, v1, v2, v3} 
would lie on the same side of the plane {v;, v2, v3} as 0, forcing the intersection of the 
Voronoi cell with the cone over Q to lie in this same half-space. However, by assumption, 
x is a point of the Voronoi cell in the opposing half-space. Hence, the simplex Q has 
negative orientation along {vj, v2, v3}. 

By construction, the barriers are acute or right triangles. The function x (which gives 
the sign of the orientations of faces) is monotonic in x;, x2,.x3 when these come from 
simplices (see the proof of Lemma 5.16.) We consider the implications of negative orien- 
tation for each kind of barrier. If the barrier is a quasi-regular triangle, then Lemma 5.17 
gives that Q is a quasi-regular tetrahedron when x < 0. If the barrier is a face of a flat 
quarter in the Q-system, then Lemma 5.16 gives that Q is a flat-quarter in the Q-system 
as well. Hence O € Q. 

The rest is clear. 


Lemma 5.20. If x lies in the open ball of radius \/2 at the origin, and if x is not in the 
closed cone over any simplex in Qo, then the origin is unobstructed at x. 


Proof. Assume for a contradiction that the origin is obstructed by the barrier T = 
{u, v, w} at x, and {0, wu, v, w} is not in Op. We show that every point in the convex hull 
of T has distance at least /2 from the origin. Since T is a barrier, each edge {u, v} has 
length at most \/8. Moreover, the heights |u| and |v| are at least 2, so that every point 
along each edge of T has distance at least /2 from the origin. Suppose that the closest 
point to the origin in the convex hull of 7 is an interior point p. Reflect the origin through 
the plane of T to get w’. The assumptions imply that the edge {0, w’} passes through 
the barrier T and has length less than J/8. If the barrier T is a quasi-regular triangle, 
then Lemma 4.22 implies that {0, uv, v, w} is a quasi-regular tetrahedron in Qo, which 
is contrary to the hypothesis. Hence T is the face of a quarter in Q). By Lemma 4.34, 
one of the simplices {0, u,v, w} or {w’, uv, v, w} is a quarter. Since these are mirror 
images, both are quarters. Hence {0, u, v, w} is a quarter and it is in the Q-system by 
Lemma 4.10. This contradicts the hypothesis of the lemma. O 


The following corollary is a V-cell analogue of a standard fact about Voronoi cells. 


Corollary 5.21. The V-cell at the origin contains the open unit ball at the origin. 


Proof. Let x lie in the open unit ball at the origin. If it is not in the cone over any 
simplex, then the origin is unobstructed by the lemma, and the origin is the closest point 
of A. Hence x € VC(O). A point in the cone over a simplex {0, v;, v2, v3} € Qo lies 
in VC(O) if and only if it lies in the set bounded by the perpendicular bisectors of v; 
and the plane through {v,, v2, v3}. The bisectors pose no problem. It is elementary to 
check that every point of the convex hull of {v;, v2, v3} has distance at least | from 
the origin. (Apply the reflection principle as in the proof of Lemma 5.20 and invoke 
Lemma 4.19.) 


112 


A Formulation of the Kepler Conjecture 49 


Lemma 5.22. Jf x € B(v, to), then x is unobstructed at v. 


Proof. For a contradiction, supposed that the barrier T obstructs x from the v. As in 
the proof of Lemma 5.20, we find that every edge of T has distance at least /2 from the 
v. We may assume that the point of T that is closest to the origin is an interior point. Let 
w be the reflection of v through 7. By Lemma 4.19, we have |v — w| > 2%. This implies 
that every point of T has distance at least f@ from v. Thus T cannot obstruct x € B(0, fo) 
from v. O 


Lemma 5.23. Inside the ball of radius ty at the origin, the V-cell and Voronoi cell 
coincide: 


B(O, to) A VC(O) = BO, to) N QO). 


Proof. Letx € B(O, to) 1 VC(O) MN Q(v), where v 4 0. By Lemma 5.22, the origin is 
unobstructed at x. Thus, |x — v| < |x| < fo. By Lemma 5.22 again, v is unobstructed 
at x, so that x € VC(v), contrary to the assumption x € VC(0). Thus B(0, fo) N 
VC(O) C &2(0). Similarly, if x € B(O, to) MN Q(O), then x is unobstructed at the origin, 
and x € VC(0). O 


Definition 5.24. For every pair of vertices v;, v2 such that {0, v;, v2} is a quasi-regular 
triangle, draw a geodesic arc on the unit sphere with endpoints at the radial projections 
of v; and v2. These arcs break the unit sphere into regions called standard regions, as 
follows. Take the complement of the union of arcs inside the unit sphere. The closure of a 
connected component of this complement is a standard region. We say that the standard 
region is triangular if it is bounded by three geodesic arcs, and say that it is nontriangular 
otherwise. 


Lemma 5.25. Let v1, v2, v3, and v4 be distinct vertices such that |v;| < 2to fori = 
1, 2,3, 4.and |v; — v3], |v2 — v4| < 2t9. Then the edges {v1, v3} and {v2, v4} do not cross. 
In particular, the arcs of Definition 5.24 do not meet except at endpoints. 


Proof. Exchanging (1, 3) with (2, 4) if necessary, we may assume for a contradiction 
that the edge {v,, v3} passes through the face {0, v2, v4}. Geometric considerations lead 
immediately to a contradiction: 


2ty < E(2, 2, 2, 2to, 2to, 2t, 2,2, 2) < |v; — v3] < 2h. O 


Lemma 5.26. Each simplex in the Q-system with a vertex at the origin lies entirely in 
the closed cone over some standard region R. 


Proof. Assume for a contradiction that Q = {0, v1, v2, v3} with v; in the open cone 
over R; and with v2 in the open cone over R2. Then {0, v;, v2} and {0, w1, w2} (a wall 
between R, and R2) overlap; this is contrary to Lemma 5.3. O 
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Remark 5.27. The next two lemmas help to determine which V-cell a given point x 
belongs to. If x lies in the open cone over a simplex Qo in Q, then Lemma 5.28 describes 
the V-cell decomposition inside Q; beyond Q the origin is obstructed by a face of Q, 
so that such an x does not lie in the V-cell at 0. If x does not lie in the open cone over 
a simplex in Q, but lies in the open cone over a standard region R, then Lemma 5.29 
describes the V-cell. It states, in particular, that, for unobstructed x, it can be determined 
whether x belongs to the V-cell at the origin by considering only the vertices w that lie 
in the closed cone over R (the standard region containing the radial projection of x). In 
this sense the intersection of a V-cell with the open cone over R is local to the cone 
over R. 


Lemma 5.28. Jf x lies in the interior of a simplex Q € Q, and if it does not lie on the 
perpendicular bisector of any edge of Q, then it lies in the V -cell of the closest vertex 


of Q. 


Proof. The segment to any other vertex v crosses a face of the simplex. Such faces are 
barriers so that v is obstructed at x. Thus, the vertices of Q are the only vertices that are 
not obstructed at x. 


Let Bo be the set of triangles T such that at least one of the following holds: 


e T isa barrier at the origin, or 
e T = {0, v, w} consists of a diagonal of a quarter in the Q-system together with 
one of its anchors. 


Lemma 5.29 (Decoupling Lemma). Let x € Io, the cube of side 4 centered at the 
origin parallel to coordinate axes. Assume that the closed segment {x , w} intersects the 
closed two-dimensional cone with center 0 over F = {0, v1, v2}, where F € Bj. Assume 
that the origin is not obstructed at x. Assume that x is closer to the origin than to both 
v1, and v2. Then x ¢ VC(w). 


Remark 5.30. The Decoupling Lemma is a crucial result. It permits estimates of the 
scoring function in Section 7 to be made separately for each standard region. The esti- 
mates for separate standard regions are far easier to come by than estimates for the score 
of the full decomposition star. Eventually, the separate estimates for each standard will 
be reassembled with linear programming techniques in Section 23. 


Proof. (This proof is a minor adaptation of Lemma 2.2 of [Ha7].) Assume for a con- 
tradiction that x lies in VC(w). In particular, we assume that w is not obstructed at x. 
Since the origin is not obstructed at x, w must be closer to x than x is to the origin: 
x-w>w-w/2. The line segment from x to w intersects the closed cone C(F) of the 
triangle F = {0, v1, v3}. 

Consider the set X containing x and bounded by the planes H, through {0, v,, w}, 
Hy through {0, v2, w}, 3 through {0, v1, v2}, Ha = {x: x- vy = yy - vv, /2}, and Hs = 
{x! X + U2 = U2 + v2/2}. The planes Hy and Hs contain the faces of the Voronoi cell at 
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Fig. 5.2. The perpendicular bisector to {0, w} (dashed line) cannot meet the quadrant X’ (shaded). 


0 defined by the vertices v; and v2. The plane H3 contains the triangle F. The planes 
H, and Hy» bound the set containing points, such as x, that can be connected to w by a 
segment that passes through C(F). 

Let P = {x: x-w > w-w/2}. The choice of w implies that X M P is nonempty. We 
leave it as an exercise to check that X M P is bounded. If the intersection of a bounded 
polyhedron with a half-space is nonempty, then some vertex of the polyhedron lies in 
the half-space. Thus, some vertex of X lies in P. 

We claim that the vertex of X lying in P cannot lie on H. To see this, pick coordinates 
(x1, x2) on the plane A, with origin vp = O so that v; = (0, z) (with z > 0) and 
XO A, C X' := {(x1, x2): x1 = 0, x2 < z/2}. See Fig. 5.2. If the quadrant X’ meets 
P, then the point v, /2 lies in P. This is impossible, because every point between 0 and 
v; lies in the Voronoi cell at 0 or vj, and not in the Voronoi cell of w. (Recall that for 
every vertex v; on a barrier at the origin, |v;| < /8.) 

Similarly, the vertex of X in P cannot lie on H2. Thus, the vertex must be the unique 
vertex of X that is not on Hj; or Hy, namely, the point of intersection of H3, Hy, and 
Hs. This point is the circumcenter c of the face F. We conclude that the polyhedron 
Xo := XN P contains c. Since c € Xo, the simplex S = {w, v1, v2, 0} has nonpositive 
orientation along the face {0, v;, v2}. By Lemmas 5.16 and 5.17, the simplex S lies 
in Oo. 

Let c be the circumcenter of the triangle F = {0, v1, v2} and let cz be the circumcenter 
of the simplex {0, v1, v2, w}. Let C be the convex hull of {0, v,/2, v2/2, c, co}. The set 
C contains the set of points separated from w by the half-plane H3, closer to w than to 
0, and closer to 0 than both v; and v2. The point x lies in this convex hull C. Since this 
convex hull is nonempty, the simplex S has negative orientation along the face {0, v1, v2}. 

By assumption, w is not obstructed at x. Hence the segment from w to x does not pass 
through the face {0, v;, v2}. The set C’ of points y € C such that the segment from w to 
y does not pass through the face {0, v), v2} is thus nonempty. The set C’ must include 
the extreme point cz of C. This means that the plane {w, vj, v2} separates cz from the 
origin, so that the simplex S has negative orientation also along the face {w, v;, v2}. This 
contradicts Lemma 5.13. 


We draw out a simple consequence of the proof. Let F = {0, v1, v2} with edges 
of length between 2 and V8. Let S = {0, w, v1, v2}, and assume that S has negative 
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orientation along F. Let c be the circumcenter of the triangle F = {0, v1, v2} and 
let cy be the circumcenter of the simplex {0, v1, v2, w}. Let C be the convex hull of 
{0, v1 /2, v2/2, c, co}. The set C contains the set of points separated from w by the half- 
plane H3, closer to w than to 0, and closer to 0 than both v; and v2. Let x lie in this 
convex hull C. 


Lemma 5.31. Jn this context, w is obstructed at x. 


Proof. This is what the final paragraph of the previous proof proves by 
contradiction. 


6. Decomposition Stars 


This section constructs a topological space DS such that each point of DS encodes the 
geometrical data surrounding a vertex in the packing. The points in this topological 
space are called decomposition stars. A decomposition star encodes all of the local 
geometrical information that will be needed in the local analysis of a sphere packing. 
These geometrical data are sufficiently detailed that it is possible to recover the V-cell at 
v € A from the corresponding point in the topological space. It is also possible to recover 
the simplices in the Q-system that have a vertex at v € A. Thus, a decomposition star 
has a dual nature that encompasses both the Voronoi-like V-cell and the Delaunay-like 
simplices in the Q-system. By encoding both structures, the decomposition star becomes 
our primary geometric object of analysis. 

It can be helpful at times to visualize the decomposition star as a polyhedral object 
formed by the union of the simplices at v in the Q-system with the V-cell at v € A. 
Although such descriptions can be helpful to the intuition, the formal definition of a 
decomposition star is rather more combinatorial, expressed as a series of indexing sets 
that hold the data that are needed to reconstruct the geometry. The formal description 
of the decomposition star is preferred because it encodes more information than the 
polyhedral object. 

The term “decomposition star” is derived from the earlier term “Delaunay star” that 
was used in [Ha2] as the name for the union of Delaunay simplices that shared a common 
vertex. Delaunay stars are star-convex. It is perhaps unfortunate that the term “star” has 
been retained, because (the geometric realization of) a decomposition star need not be 
star convex. In fact, Remark 5.10 suggests that V-cells can be rather poorly behaved in 
this respect. 


6.1. Indexing Sets 


We are ready for the formal description of decomposition stars. 
Let wm = {0, 1, 2,...}. Pick a bijection b: w — A and use this bijection to index the 
vertices b(i) = v; € A,i = 0, 1,2.... Define the following indexing sets: 


e Let) =o. 
e Let J, be the set of unordered pairs of indices {i, j} such that |v; — v;| < 2% = 2.51. 
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Let J; be the set of unordered tuples of indices {i, 7, k, £} such that the corresponding 
simplex is a strict quarter. 

e Let J, be the set of unordered tuples {i, j,k, 2} of indices such that the simplex 
{u;, Vj, Ug, Ue} iS in the Q-system. 

Let J; be the set of unordered triples {i, 7, k} of indices such that v; is an anchor of 
a diagonal {v;, vg} of a strict quarter in the Q-system. 

Let J¢ be the set of unordered pairs {i, j} of indices such that the edge {v,, v;} has 
length in the open interval (2f0, »/8). (This set includes all such pairs, whether or 
not they are attached to the diagonal of a strict quarter.) 

e Let J, be the set of unordered triples {i, j,k} of indices such that the triangle 
{v;, vj, Ug} 1s a face of a simplex in the Q-system and such that the circumradius 
is less than J2. 

Let Jg be the set of unordered quadruples {i, j, k, 2} of indices such that the corre- 
sponding simplex {v,;, v;, vx, Ue} is a quasi-regular tetrahedron with circumradius 
less than 1.41. 


The data are highly redundant, because some of the indexing sets can be derived from 
others. However, there is no need to strive for a minimal description of the data. 
Set dy = 2/2 + 4/3. We recall that A(v, dy) = {w € A: |w — v| < do}. Let 


T’ = {i: v; € A(v, do)}. 


It is the indexing set for a neighborhood of v. 
Fix a vertex v = v, € A. Let Jj = {{a}}. Let 


T= {x el: x CT}, for 1<j <8. 


Each I is a finite set of finite subsets of w. Hence I; € P(P(@)), where P(X) is the 
powerset of any set X. 

Associate with each v € A the function f: T’ > B(O, do) given by f(i) = v; — v, 
and the tuple 


t= (h,...,) € P(P@))’. 


There is a natural action of the permutation group of w on the set of pairs (f, t), where 
a permutation acts on the domain of f and on P(P(q)) through its action on w. Let 
Lf, t] be the orbit of the pair (f, t) under this action. The orbit [f, ¢] is independent of 
the bijection b: m —> A. Thus, it is canonically attached to (v, A). 


Definition 6.1. Let DS° be the set of all pairs [ f, t] that come from some v ina saturated 
packing A. 


Put a topology on all pairs (f, t) (as we range over all saturated packings A, all 
choices of indexing b: w — A, and all v € A) by declaring (f, t) to be close to (f", t’) 
if and only if t = ¢’, domain( f) = domain(f’), and for alli € domain(f), f (i) is close 
to f’(i). That is, we take the topology to be that inherited from the standard topology 
on B(0, do) and the discrete topology on the finite indexing sets. 
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The topology on pairs (f, ¢) descends to the orbit space and gives a topology on DS°. 

There is a natural compactification of DS° obtained by replacing open conditions by 
closed conditions. That is, for instance if {i, j} is a pair in J6, we allow | f(i) — f(J)| to 
lie in the closed interval [2f9, /8]. The conditions on each of the other indexing sets J; 
are similarly relaxed so that they are closed conditions. 

Compactness comes from the compactness of the closed ball B(O, do), the closed 
conditions on indexing sets, and the finiteness of T’. 


Definition 6.2. Let DS be the compactification given above of DS°. Call it the space 
of decomposition stars. 


Definition 6.3. Let v be a vertex in a saturated packing A. We let D(v, A) denote the 
decomposition star attached to (v, A). 


Because of the discrete indexing sets, the space of decomposition stars breaks into a 
large number of connected components. On each connected component, the combina- 
torial data are constant. Motion within a fixed connected component corresponds to a 
motion of a finite set of sphere centers of the packing (in a direction that preserves all of 
the combinatorial structures). 

In a decomposition star, it is no longer possible to distinguish some quasi-regular 
tetrahedra from quarters solely on the basis of metric relations. For instance, the simplex 
with edge lengths 2, 2, 2, 2, 2, 2f9 is a quasi-regular tetrahedron and is also in the closure 
of the set of strict quarters. The indexing set J}, which is part of the data of a decompo- 
sition star, determines whether the simplex is treated as a quasi-regular tetrahedron or a 
quarter. 

Roughly speaking, two decomposition stars D(v, A) and D(v’, A’) are close if the 
translations A(v, dj) — v and A’(v’, do) — v’ have the same cardinality, and there is 
a bijection between them that respects all of the indexing sets / fi and proximity of 
vertices. 


6.2. Cells Attached to Decomposition Stars 


To each decomposition star we can associate a V-cell centered at 0 by a direct adaptation 
of Definition 5.6. 


Lemma 6.4. The V-cell at v depends on A only through A(v, do) and the indexing 
sets Ii. 
j 


Proof. We wish to decide whether a given x belongs to the V-cell at v or to another 
contender w € A. We assume that x € J,, for otherwise x cannot belong to the V-cell 
at v. Similarly, we assume x € J,,. We must determine whether v or w is obstructed at 
x. For this we must know whether barriers lie on the path between x and v (or w). Since 
|x — w| < 2V3 and |x — v| < 2V3, the point p of intersection of the barrier and the 
segment {x, v} (or {x, w}) satisfies |x — p| < 2,/3. All the vertices of the barrier then 
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have distance at most /8+2,/3 from x, and hence distance at most dy = /8+4/3 from 
v. The decomposition star D(v, A) includes all vertices in A(v, do) and the indexing 
sets of the decomposition star label all the barriers in A(v, do). Thus, the decomposition 
star at v gives all the data that are needed to determine whether x € I, belongs to the 
V-cell at v. 


Corollary 6.5. There is a V-cell VC(D) attached to each decomposition star D such 
that if D = D(v, A), then VC(D) + v is the V-cell attached to (v, A) in Definition 5.6. 


Proof. By the lemma the map from (v, A) maps through the data determining the 
decomposition star D(v, A). The definition of V-cell extends: the V-cell at 0 attached to 
[ f, t] 1s the set of points in B(O, Co) for which the origin is the unique closest unobstructed 
vertex of range(f). The barriers for the obstruction are to be reconstructed from the 
indexing data sets J; of f. 


Lemma 6.6. VC(D) is a finite union of nonoverlapping convex polyhedra. Moreover, 
D® vol(VC(D)) is continuous. 


Proof. For the proof, we ignore sets of measure zero, such as finite unions of planes. We 
may restrict our attention to a single connected component of the space of decomposition 
stars. On each connected component, the indexing set for each barrier (near the origin) 
is fixed. The indexing set for the set of vertices near the origin is fixed. For each D the 
VC-cell breaks into a finite union of convex polyhedra by Lemma 5.9. 

As the proof of that lemma shows, some faces of the polyhedra are perpendicular 
bisecting planes between two vertices near the origin. Such planes vary continuously 
on (a connected component) of DS. The other faces of polyhedra are formed by planes 
through three vertices of the packing near the origin. Such planes also vary continuously 
on DS. It follows that the volume of each convex polyhedron is a continuous function 
on DS. The sum of these volumes, giving the volume of VC(D), is also continuous. 


Lemma 6.7. Let A be a saturated packing. The Voronoi cell Q2(v) at v depends on A 
only through A(v, do). 


Proof. Let x be an extreme point of the Voronoi cell Q2(v). The vertex v is one of the 
vertices closest to x. If the distance from x to v is at least 2, then there is room to place 
another ball centered at x into the packing without overlap. Then A is not saturated. 
Thus, the distance from x to v is less than 2. The Voronoi cell lies in the ball B(v, 2). 
The Voronoi cell is bounded by the perpendicular bisectors of segments {v, w} for w € A. 
If w has distance 4 or more from v, then the bisector cannot meet the ball B(v, 2) and 
cannot bound the cell. Since 4 < do, the proof is complete. 


Corollary 6.8. The vertex v and the decomposition star D(v, A) determine the Voronoi 
cellatv. Infact, the Voronoi cell is determined by v and the first indexing set I; of D(v, A). 
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Definition 6.9. The Voronoi cell Q(D) of D € DS is the set containing the origin 
bounded by the perpendicular bisectors of {0, v;} fori € I. 


Remark 6.10. It follows from Corollary 6.8 that 
Q(D(v, A)) =v+ Q(v). 


In particular, they have the same volume. 


Remark 6.11. From a decomposition star D, we can recover the set of vertices U(D) 
of distance at most 2f9 from the origin, the set of barriers at the origin, the simplices of 
the Q-system having a vertex at the origin, the V-cell VC(D) at the origin, the Voronoi 
cell 82(D) at the origin, and so forth. In fact, the indexing sets in the definition of the 
decomposition star were chosen specifically to encode these structures. 


6.3. Colored Spaces 


In Section 3 we introduced a function o that will be formally defined in Definition 7.8. 
The details of the definition of o are not needed for the discussion that follows. The 
function o on the space DS of decomposition stars is continuous. This section gives an 
alternate description of the sense in which this function is continuous. 

We begin with an example that illustrates the basic issues. Suppose that we have a 
discontinuous piecewise linear function on the unit interval [—1, 1], as in Fig. 6.1. It is 
continuous, except at x = 0. 

We break the interval in two at x = 0, forming two compact intervals [—1, 0] and 
[0, 1]. We have continuous functions f_: [—1,0] —~ Rand f,: [0, 1], such that 


_ | f-@), x €[-1, 0], 
oe Ce otherwise. 


We have replaced the discontinuous function by a pair of continuous functions on smaller 
intervals, at the expense of duplicating the point of discontinuity x = 0. We view this 
pair of functions as a single function F on the compact topological space with two 


L~ 
e 


Fig. 6.1. A piecewise linear function. 
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components 
[-1,0] x {-} and [0,1] x {+}, 


where F(x, a) = fa(x) anda € {—, +}. 

This is the approach that we follow in general with the Kepler conjecture. The function 
o is defined by a series of case statements, and the function does not extend continuously 
across the boundary of the cases. However, in the degenerate cases that land precisely 
between two or more cases, we form multiple copies of the decomposition star for each 
case, and place each case into a separate compact domain on which the function o is 
continuous. 

This can be formalized as a colored space. A colored space is a topological space X 
together with an equivalence relation on X with the property that no point x is equivalent 
to any other point in the same connected component as x. We refer to the connected 
components as colors, and call the points of X colored points. We call the set of equiv- 
alence classes of X the underlying uncolored space of X. Two colored points are equal 
as uncolored points if they are equivalent under the equivalence relation. 

In our example, there are two colors “—” and “+.” The equivalence class of (x, a) 
is the set of pairs (x, b) with the same first coordinate. Thus, if x 4 0, the equivalence 
class contains one element (x, sign(x)), and in the boundary case x = 0 there are two 
equivalent elements (0, —) and (0, +). 

In our treatment of decomposition stars, there are various cases: whether an edge has 
length less than or greater than 2f9, less than or greater than /8, whether a face has 
circumradius less than or greater than s/2, and so forth. By duplicating the degenerate 
cases (say an edge of exact length 2/9), creating a separate connected component for 
each case, and expressing the optimization problem on a colored space, we obtain a 
continuous function 0 on a compact domain X. 

The colorings have in general been suppressed in places from the notation. To ob- 
tain consistent results, a statement about x € [2, 2%] should be interpreted as having 
an implicit condition saying that x has the coloring induced from the coloring on the 
component containing [2, 279]. A later statement about y € [2%0, /8] deals with y of 
a different color, and no relation between x and y of different colors is assumed at the 
endpoint 2%. 


7. Scoring 


This section is coauthored by Samuel P. Ferguson and Thomas C. Hales. 

In earlier sections, we describe each packing of unit balls by its set A C R? of centers 
of the packing. We showed that we may assume that our packings are saturated in the 
sense that there is no room for additional balls to be inserted into the packing without 
overlap. Lemma 3.3 shows that the Kepler conjecture follows if for each saturated packing 
A we can find a function A: A — R with two properties: the function is fec-compatible 
and it is saturated in the sense of Definition 3.2. 

The purpose of the first part of this section is to define a function A: A — R for every 
saturated packing A and to show that it is negligible. The formula defining A consists 
of a term that is a correction between the volume of the Voronoi cell (Q2(v) and that of 
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the V-cell VC(v) and a further term coming from simplices of the Q-system that have 
a vertex at v. 

A major theorem in this volume will be that this negligible function is fec-compatible. 
The proof of fec-compatibility can be expressed as a difficult nonlinear optimization 
problem over the compact topological space DS that was introduced in Section 6. In 
fact, we construct a continuous function Ao on the space DS such that for each saturated 
packing A and each v € A, the value of the function A at v is a value in the range of the 
function Ag on DS. In this way, we are able to translate the fec-compatibility of A into 
an extremal property of the function Ao on the space DS. 

The proof of fcc-compatibility is more conveniently couched as an optimization 
problem over a function that is related to the function Ag by an affine rescaling. This 
new function is called the score and is denoted o. (The exact relationship between Ag 
and o appears in Definition 7.12.) The function o is a continuous function on the space 
DS. This function is defined in the final paragraphs of this section. 


7.1. Definitions 


For every saturated packing A, and v € A, there is a canonically associated decompo- 
sition star D(v, A). The negligible function A: A — R that we define is a composite 


A= Ago D(-, A): A> DS > R, vb Div, A) Ao(D(v, A)), (7.1) 


where Ay: DS — Risas defined by (7.2) and (7.6) below. Each simplex in the Q-system 
with a vertex at v defines by translation to the origin a simplex in the Q-system with 
a vertex at 0 attached to D(v, A). Let Qo(D) be this set of translated simplices at the 
origin. This set is determined by D. 


Definition 7.1. Let Q be a quarter in Qo(D). We say that the context of Q is (p, q) if 
there are p anchors and p — q quarters along the diagonal of Q. Write c(Q, D) for the 
context of 0 € Qo(D). 


Note that g is the number of “gaps” between anchors around the diagonal. For example, 
the context of a quarter in a quartered octahedron is (4, 0). The context of a single quarter 
is (2, 1). 

The function Ag will be defined to be a continuous function on DS of the form 


Ao(D) = —vol(Q2(D)) + vol(VC(D)) + oe Ai(Q,c(Q, D), 0). (7.2) 
QeQo(D) 


Thus, the function Ag measures the difference in volume between the Voronoi cell and the 
V-cell, as well as certain contributions A; from the Q-system. The function A;(Q, c, v) 
depends on Q, its context c, and a vertex v of Q. The function A;(Q,c, v) will not 
depend on the second argument when Q is a quasi-regular tetrahedron. (The context is 
not defined for such simplices.) 


Definition 7.2. An orthosimplex consists of the convex hull of {0, vj, vj) + v2, vy + 
V2 + v3}, where v2 is a vector orthogonal to vj, and v3 is orthogonal to both v; and v2. We 
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Vato t3 


+N 


V3 


MD 
Fig. 7.1. The Rogers simplex is an orthosimplex. 


can specify an orthosimplex up to congruence by the parameters a = |v;|,b = |v; + v2|, 
and c = |v; + v2 + v3|, where a < b < c. This parametrization of the orthosimplex 
departs from the usual parametrization by the lengths |v;|, |v2|, |v3|. Fora < b < c, the 
Rogers simplex R(a, b, c) is an orthosimplex of the form 


R(a, b,c) = S(a, b,c, Jc? —b2,/c2 — a2, /b2 = a’). 
See Fig. 7.1. 


Definition 7.3. Let R be a Rogers simplex. We define the quoin of R to be the wedge- 
like solid (a quoin) situated above R. It is defined as the solid bounded by the four planes 
through the faces of R and a sphere of radius c at the origin. (See Fig. 7.2.) We let quo(R) 
be the volume of the quoin over R. If R = R(a, b,c) is a Rogers simplex, the volume 
quo(R) is given explicitly as follows: 
6quo(R) = (a+ 2c)(c — a)’ arctan(e) + a(b* — a”)e 
— 4c? arctan(e(b — a)/(b +c)), (7.3) 


where e > 0 is given by e*(b* — a?) = (c? — b’). 


0 


Fig. 7.2. The quoin above a Rogers simplex is the part of the shaded solid outside the illustrated box. It is 
bounded by the shaded planes, the plane through the front face of the box, and a sphere centered at the origin 
passing through the opposite corner of the box. 
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Let S be a simplex and let v be a vertex of that simplex. Let VC(S, v) be the subset 
of |S| consisting of points closer to v than to any other vertex of S. By Lemma 5.28, if 
S € Qo(D), then 


VC(S, 0) = VC(D) N |S. 


Under the assumption that S contains its circumcenter and that every one of its faces 
contains its circumcenter, an explicit formula for the volume vol(VC(S, v)) has been 
calculated in Section 8.6.3 of [Ha6]. This volume formula is an algebraic function of the 
edge lengths of S, and may be analytically continued to give a function of S with chosen 
vertex v: 


vol VC*"(S, v). 


Lemma 7.4. Let B(0, t) be a ball of radius t centered at the origin. Let v; and v2 be 
vertices. Assume that |v,| < 2t and |v2| < 2t. Truncate the ball by cutting away the 
caps 


cap; = {x € B(O,t): |x = vi | a |x|}. 


Assume that the circumradius of the triangle {0, v1, v2} is less thant. Then the intersection 
of the caps, cap, M Capp, is the union of four quoins. 


Proof. This is true by inspection. See Fig. 7.3. Slice the intersection cap, M cap, into 
four pieces by two perpendicular planes: the plane through {0, v;, v2}, and the plane 
perpendicular to the first and passing through 0 and the circumcenter of {0, v;, v2}. Each 
of the four pieces is a quoin. 


Definition 7.5. Let v € IR? and let X be a measurable subset of R?. Let sol(X, v) be 
the area of the radial projection of X\{0} to the unit sphere centered at the origin. We 
call this area the solid angle of X (at v). When v = 0, we write the function as sol(X). 


Let S = {uo, v1, V2, v3} be a simplex. Fix ¢ in the range fo < t < /2. Assume that t 
is at most the circumradius of S. Assume that it is at least the circumradius of each of the 


Fig. 7.3. The intersection of two caps on the unit ball can be partitioned into four quoins (shaded). 
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faces of S. Let VC, (S, vo) be the intersection of VC(S, vg) with the ball B(vo, t). Under 
the assumption that S contains its circumcenter and that every one of its faces contains 
it circumcenter, an explicit formula for the volume 


vol(VC,(S, vo)) 


is calculated by means of Lemma 7.4 through a process of inclusion and exclusion. In 
detail, start with |.$|M B(vo, t). Truncate this solid by caps: cap,, cap,, and cap; bounded 
by the sphere of radius ¢ centered at vg and the perpendicular bisectors (respectively) of 
{vo, Vi}, {Vo, vi}, and {vo, v2}. If we subtract the volume of each cap, cap,, then we must 
add back the volume of the doubly counted intersections of the caps. The intersections 
of caps are given as quoins (Lemma 7.4). This leads to the following formula. Let 
h; = |v;|/2 and n;; = 7 (0, v;, v;), and let S3 be the group of permutations of {1, 2, 3} in 


(S) Cy dih(S, v; 
vol VC4(S, 9) = SS) — yO oteap, + SD quo(ROhi. mys). (7-4) 
: IU 


3 i=l (i, J, VES 


We extend formula (7.4) by setting 
quo(R(a, b, c)) = 0, 


if the constraint a < b < c fails to hold. Similarly, set volcap; = 0 if |v;| > 2¢. With 
these conventions, formula (7.4) extends to all simplices. We write the extension of 
vol VC,(S, v) as 


vol VC} (S, v). 


Definition 7.6. Let? 


s-vor(S, v) 4(—6,,Vvol VC*"(S, v) + sol(S, v)/3), 


s-vor(S, v,t) = 4(—Socvol VC} (S, v) + sol(S, v)/3), 


and 
s-vorg(S, v) = s-vor(S, v, fo). 


When it is clear from the context that the vertex v is fixed at the origin, we drop v from 
the notation of these functions. If S = {v;, v2, v3, v4}, we define I(S) as the average 


1~4 
r(S) = s s-vor(S, v;). (7.5) 
The average I'(S) is called the compression of S. 


3 In [Hal] the volumes in this definition were volumes of Voronoi cells, and hence the notation vor for 
the function was adopted. We retain vor in the notation, although this direct connection with Voronoi cells has 
been lost. 


125 


62 T. C. Hales and S. P. Ferguson 


Definition 7.7. Let Q be a quarter. Let n*(Q) be the maximum of the circumradii of 
the two faces of Q along the diagonal of Q. 


Let Q be a simplex in the Q-system. We define an involution v — 0 on the vertices 
of Q as follows. If Q is a quarter and v is an endpoint of the diagonal, then let 0 be the 
opposite endpoint of the diagonal. In all other cases, set 6 = v. 

We are ready to complete the definition of the function A: A — R. The definition of 
A was reduced to that of Ap in (7.1). The function Ag was reduced in turn to that of A; 
in (7.2). To complete the definition, we define A}. 


Definition 7.8. Set 


\(S, Sc, 
weve aves 2 (7.6) 
3doct 4Soct 


where o is given as follows: 


1. When S is a quasi-regular tetrahedron: 
(a) If the circumradius of S is less than 1.41, set 


o(S,—,v) =T(S). 
(b) If the circumradius of S is at least 1.41, set 
o(S, —, v) = s-vor(S, v). 


2. When S is a strict quarter: 
(a) If nt(S) < 4/2: 
(i) If the context c is (2, 1) or (4, 0), set 


o(S,c,v) = T(S). 
(ii) If the context of S is anything else, set 


s-voro(S, v) — s-voro(S, 0) 


o(S,c,v) =T(S)+ 5) 


(b) If n*(S) = V2: 
(i) If the context of S is (2, 1), set 


o(S,c, v) = s-vor(S, v). 


(ii) If the context of S is (4, 0), set 


se we s-vor(S, v) ‘ s-vor(S, b) 


(iii) If the context of S is anything else, set 
s-vor(S, v) + s-vor(S, d) 
2 
s-vorg(S, v) — S-vorg(S, 0) 
a 5 : 


o(S,c,v) = 
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When the context and vertex v are given, we often write o(S) or o(S,v) for 
o(S,c, v). 

When n+ < V2, we say that the quarter has compression type. Otherwise, we say 
it has Voronoi type. To say that a quarter has compression type means that I'(S’) is one 
term of the function o(S, v). It does not mean that (S$) is equal to o(S, v). 


The definition of o on quarters can be expressed a second way in terms of a function 
wu. If S is a quarter, set 


P(S), if nt(S) < V2, 
pS, v) = (7.7) 
s-vor(S, v), otherwise. 
If S is a flat quarter, we have o (S, c, v) = w(S, v), for all contexts c. 
Suppose S is an upright quarter. Definition 7.8 can be expressed as follows: 


e Context (2, 1): set a(S, c, v) = w(S, v). 

e Context (4, 0): set a(S, c, v) = (u(S, v) + w(S, d))/2. 

e Other contexts: set o(S,c,v) = (u(S,v) + w(S,d) 4+ s-voro(S,v) — 
s-vorg(S, 0))/2. 


Lemma 7.9. Ap: DS — R is continuous. 


Proof. The continuity of D +> vol VC(D) is proved in Lemma 6.6. The continuity of 
D+ vol &2(D) is similarly proved. The terms vol VC(S, v) and sol(S, v) are continuous. 
To complete the proof we check that the function o(S,c, v) is continuous. It is not 
continuous when viewed as a function of the set of quarters, because of the various 
cases breaking at circumradius 1.41 and nt (S) = /2. However, these cutoffs have been 
inserted into the data defining a decomposition star (in the indexing sets /g and Jy). Thus, 
the different cases in the definition of o (S, c, v) land in different connected components 
of the space DS and continuity is obtained. 


We conclude this section with a result that will be of use in the next section. 


Lemma 7.10. Let S = {v 1, v2, v3, v4} be a simplex in the S-system, and let c be its 
context. Then 


4 
Y> Ai(S,¢, v;) = 0. 


i=1 


Proof. By formula (7.6), this is equivalent to 
4 


4 

Yas, C, Uj) = s-vor(S, c, v;). (7.8) 
i=l i=l 

Equation (7.8) is evident from Definition 7.8 for o. In fact, the terms of the form s-voro 

have opposing signs and cancel when we sum. The other terms are weighted averages of 

the terms s-vor(S, c, v;). Equation (7.8) is thus established because a sum is unaffected 

by taking weighted averages of its terms. oO 
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7.2. Negligibility 


Let B(x, r) be the closed ball of radius r € R centered at x. Let A(x, r) = AN B(x,r). 
Recall from Definition 3.2 that a function A: A — Ris said to be negligible if there 
is a constant C, such that for all r > 1, 


Y> AQ) < Cir. 


veA(x,r) 


Recall the function A: A — R given by (7.1). Explicitly, let 
A(v) = Ao(Dv, A)), 


where Apo in turn depends on functions A; and o, as determined by (7.2), (7.6), and 
Definition 7.8. 


Theorem 7.11. The function A of (7.1) is negligible. 


Proof. First we consider a simplification, where we replace A with A’ defined by 
A’(v, A) = —vol(Q(D(v, A))) + vol(VC(D(v, A))). 


(That is, at first we ignore the function A;.) The Voronoi cells partition R?, as do the 
V-cells. We have Q(v, A) C B(v, 2) (by saturation) and VC(v, A) C Biv, 24/3) (by 
Definition 5.5). Hence the Voronoi cells with v € A(x, r) cover B(x, r — 2). Moreover, 
the V-cells with v € A(x, r) are contained in B(x, r + 24/3). Hence 


> Av) < —vol B(x, r — 2) + vol B(x, r + 2V3) < Cir? 


veA(x,r) 


for some constant C}. 
If we do not make the simplification, we must include the sum 


Y> Ai (@, ¢, »). 


veA(x,r) QEO,(D(v, A)) 


Each quarter Q = {vj, v2, v3, v4} in the Q-system occurs in four sets Q,,(D(v;, A)). 
By Lemma 7.10 the sum cancels, except when some vertex of Q lies inside A(x, r) 
and another lies outside. Each such simplex lies inside a shell of width 2,/8 around the 
boundary. The contribution of such boundary terms is again bounded by a constant times 
r?. This completes the proof. 


7.3. Fec-Compatibility 
We have constructed a negligible function A. The rest of this volume will prove that this 


function is fec-compatible. This section translates fec-compatibility into a property that 
will be easier to prove. To begin with, we introduce a rescaled version of the function A. 
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Definition 7.12. Leto: DS — R be given by 
0 (D) = —4boc(vol Q(D) + Ag(D)) + 1677/3. 
It is called the score of the decomposition star. 
Recall from Definition 3.6 the constant pt © 0.05537. This constant is called a point. 
Lemma 7.13. Let Ag, A, and o be the functions defined by (7.1), (7.2), (7.6), and 


Definition 7.8. The following are equivalent: 


1. The minimum of the function on DS given by 
Dw vol Q(D) + Ao(D) 
is /32. 


2. The maximum of o on DS is 8 pt. 
Moreover, these statements imply 


e For every saturated packing A, the function A is fec-compatible. 
(Eventually, we prove fcc-compatibility by proving o(D) < 8 pt for all D € DS.) 


Proof. To see the equivalence of the first and second statements, use Definition 7.12, 
and the identity 


8 pt = —A8pe(/32) + 1677/3. 


(Note that this identity is parallel in form to Definition 7.12 for o.) 

For a given saturated packing A, the function A has the form A(v) = Ap(D(v, A)). 
Also, (D(v, A)) is a translate of Q(v), the Voronoi cell at v. In particular, they have 
the same volume. Thus, vol &2(v) + A(v) lies in the range of the function 


vol 2(D) + Ao(D) 


on DS. The minimum of this function is 32 by the first of the equivalent statements. 
It now follows from the definition of fec-compatibility, that A: A — R is indeed fcc- 
compatible. 


Theorem 7.14. Jf the maximum of the function o on DS is 8 pt, then for every saturated 
packing A there exists a negligible fcc-compatible function A. 


Proof. This follows immediately from Theorem 7.11 and Lemma 7.13. 


7.4. Scores of Standard Clusters 


The last section introduced a function o called the score. We show that the function o 
can be expressed as a sum over terms attached to each of the standard regions. 
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Definition 7.15. A standard cluster is a pair (R, D) where D is a decomposition star 
and R is one of its standard regions. A quad cluster is the standard cluster obtained when 
the standard region is a quadrilateral. 


We break o into a sum 
o(D) = )° on(D), (7.9) 
R 
indexed by the standard clusters (R, D). Let 
VCr(D) = VC(D) MN cone(R), 
whenever R is a measurable subset of the unit sphere. Let 
Q(R, D) = {Q € Qo(D): Q C cone(R)}. 
By Lemma 5.26, each Q is entirely contained in the cone over a single standard region. 
Definition 7.16. Let R be a measurable subset of the unit sphere. Set 
vorr(D) = 4 (—doct Vol VCr(D) + sol(R)/3) . 


Let R be a standard region. Set 


o(D) = vorr(D)— 48. D> Ai(Q,c(Q, D), 0). 


QeEQo(R,D) 
Lemma 7.17. o(D) = )0p oR(D), where the sum runs over all standard regions R. 


Proof. 


o(D) = —48o¢(vol Q(D) + Ao(D)) + 1627/3 


= —48c(vol VC(D) + D> Ai (Q, c(Q, D), 0)) + (4)(42/3) 
QeEQo(D) 


Hs (Ba vt VED) > 4u(0.6(0, D).0)+08)/) 
R QeQo(R, D) 


Also, we have 


vor(D) = > vorr(D). (7.10) 
RER(D) 


Lemma 7.18. Let R’ C R be the part of a standard region that does not lie in any cone 
over any Q € Qo(R, D). Then 


Or(D) = vorrg (D) + yi o(Q,c(Q, D), 0). 


QeQo(R,D) 
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Proof. Substitute the definition of A; (7.6) into the definition of or(D), noting that 
VC(Q, 0) = VCr”(D), where R” is the intersection of Q with the unit sphere. 


Remark 7.19. Lemma 7.18 explains why we have chosen the same symbol o for the 
functions or(D) and o(Q, c, v). We can view Lemma 7.18 as asserting a linear relation 
in the functions o: 


oR(D) = op (D) + 9) 0(Q,c, 0). 


The sum runs over Q € Qo that lie in the cone over R. 


7.5. Scores of Simplices and Cones 


Many of the functions in this paper are defined by terms involving volumes of simple 
solids. To give estimates on the functions, it is often convenient to partition the solids 
into smaller pieces and then define corresponding functions on each of the pieces. For 
this reason, we define some variants of the functions vor and o. 


Remark 7.20. We now define a few more variants of the function vor. The func- 
tion s-vor and its truncated version s-vor(-, tf) have been defined already. The function 
vorr(D) will also be given a truncated version vorr(D, t), for areal truncation parameter 
t > 0. The special case vorr(D, to) will be abbreviated voro, g(D). There will be another 
variant r-vor for Rogers simplices, and another c-vor for general sets. The general form 
of these functions is 


c-vor(X) = 4(—doct Vol(X) + sol(X)/3), 
for any subset X C R?. The differences between the different versions of vor come from 


the different choices of the set X and the way they are parametrized. 


Definition 7.21. Let R = R(a,b,c) be a Rogers simplex. Assume that the vertex 
terminating the edges of lengths a, b, and c is situated at the origin. Let 


r-vor(R) = 4(—doct Vol(R) + sol(R)/3). 
Definition 7.22. Let C(h, t) denote the compact cone of height / and circular base. 
Set 
P(A, t) = 2(2 — docith(h + t))/3. 
Then 
c-vor(C (h, t)) = 271 — h/t)g(h, t). (7.11) 
Remark 7.23. Below, we introduce variants of the function 0. We have already encoun- 


tered o in Definitions 7.8, 7.12, and 7.16. Informally, we call o (and various functions 
that are closely related to it) the score. Equation (7.11) represents the score of C(h, ft). 
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The solid angle of C(h, t) is 27 (1—h/t), so g(A, t) is the score per unit area. Also, g(t, f) 
is the score per unit area of a ball of radius t. That is, g(t, t) = 4(—doct Vol/ sol +4). 
We set 


3 
s-vor(S,t) = sol(S)p(t, t) + b>. d\(1 — hi/t)(p(hi, t) — g(t, t)) 
i=1 
h, <t 


— So Scot quo(R (Ai, (Vi. Vis Ye43)s 1). (7.12) 
(i, J, KES3 


In the definition we adopt the convention that quo(R) = 0, if R = R(a, b,c) does not 
exist (that is, if the condition 0 < a < b < c Is violated). In the second sum, $3 is the 
set of permutations on three letters. This definition is compatible with Definition 7.6. 


Similarly, we define vorp(D, t) for arbitrary standard clusters (P, D). (We shift 
notation from R to P for a standard region to avoid conflict with Rogers simplices R in 
the following definition.) Extending the notation in an obvious way, we have 


vorp(D,t) = sol(P)g(t, t) + > di — |vl/21))(e(uil/2, 1) — ett, 1) 


|v;|<2¢ 


— J 48oc quo(R). (7.13) 
R 


The first sum runs over vertices in P of height at most 2t. The second sum runs over 
Rogers simplices R(|v;|/2, n(F), t)in P, where F = {0, v1, v2} is a face of circumradius 
n(F) at most t, formed by vertices in P. The constant d; is the total dihedral angle along 
{0, v;} of the standard cluster. The truncations t = f9 = 1.255 and t = »/2 will be of 
particular importance. Set A(h) = (1 — h/to) (@(A, to) — (to, to)). 


Remark 7.24. We have introduced both untruncated and truncated versions of func- 
tions vor and o.. The truncated versions are used to give upper bounds on the untruncated 
versions. For example, in the function o (D), the V-cell contributes through its volume, 
as in Remark 7.20. The volume appears with a negative coefficient —46,.,. Thus, we 
obtain an upper bound on o(D) by discarding bits of volume from the V-cell. This 
suggests that we might try to give upper bounds on the score o(D) by truncating the 
V-cell in various ways. This is the reason for the truncated versions of these functions. 


7.6. The Example of a Dodecahedron 


Example 7.25. The following example illustrates why better bounds on the density of 
packings can be obtained with o (D) than with a naive approach based on the volume of 
Voronoi cells. By scoring quasi-regular tetrahedra with the compression function I"($) 
rather than s-vor(S), we will find that the score is lowered below 8 pt for configurations 
with many quasi-regular tetrahedra. To work one example, we assume that the decompo- 
sition star consists of twelve vertices located at distance 2 from the origin, at the vertices 
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of a regular icosahedron. The score is approximately 
201 (S(2, 2, 2, 2.10292, 2.10292, 2.10292)) + 1.8 pt < 8 pt. 
If s-vor(S) had been used, the score would violate Theorem 3.7: 
20 s-vor(S) © 13.5493 pt > 8 pt. 


(This is tied to the fact that the regular dodecahedron of inradius 1 has a smaller volume 
than the rhombic dodecahedron of inradius 1.) 
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Abstract. This paper is the third in a series of six papers devoted to the proof of the 
Kepler conjecture, which asserts that no packing of congruent balls in three dimensions has 
density greater than the face-centered cubic packing. In the previous paper in this series, 
a continuous function f on a compact space was defined, certain points in the domain 
were conjectured to give the global maxima, and the relation between this conjecture and 
the Kepler conjecture was established. This paper shows that those points are indeed local 
maxima. Various approximations to f are developed, that will be used in subsequent papers 
to bound the value of the function f. The function f can be expressed as a sum of terms, 
indexed by regions on a unit sphere. Detailed estimates of the terms corresponding to 
triangular and quadrilateral regions are developed. 


Introduction 


This paper has three objectives. The first is dealing with the two types of decomposition 
stars that attain the optimal Kepler conjecture bound. The second is obtaining general 
upper bounds on the score of decomposition star by truncation. The third is obtaining 
various upper bounds on the score associated to individual triangular and quadrilateral 
regions of a general decomposition star. 

The first section contains a proof that the decomposition stars attached to the face- 
centered cubic (fcc) and hexagonal-close packing (hcp) give local maxima to the scoring 
function on the space of all decomposition stars. The proof describes precisely determined 
neighborhoods of these critical points. These special decomposition stars are shown to 
yield the global maximum of the scoring function on these restricted neighborhoods. 

The second section gives an approximation to a decomposition star that provides 
an upper bound approximation to the scoring function o. In the simplest cases, the 
approximation to the decomposition star is obtained by truncating the decomposition 
star at distance fg = 1.255 from the origin. More generally, we define a collection of 
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simplices (that do not overlap any simplices in the Q-system), and define a somewhat 
different truncation for each type of simplex in the collection. For want of a more 
suggestive term, these simplices are said to form the S-system. 

When truncation at fy cuts too deeply, we reclaim a scrap of volume that lies outside 
the ball of radius fp but still inside the V-cell. This scrap is called a crown. These scraps 
are studied in that same section. 

In a final section we develop a series of bounds on the score function in triangular 
and quadrilateral regions, for use in later papers. 


8. Local Optimality 


The first several sections in the previous papers have established the fundamental def- 
initions and constructions of this issue. This section establishes the local optimality 
of the function a: DS — R in a neighborhood of the decomposition stars of the fcc 
and hep. 


8.1. Results 


Here is a sketch of the proof of local optimality. The fcc and hcp score precisely 8 pt. They 
also contain precisely eight tetrahedra around each vertex. In fact, the decomposition 
stars have eight quasi-regular tetrahedra and six other quad clusters. The proof shows 
that each of the eight quasi-regular tetrahedra scores at most 1 pt. Equality is obtained 
only when the tetrahedron is regular of side 2. Furthermore, the proof shows that each 
of six quad clusters have a nonpositive score. It will follows from these facts that any 
decomposition star with eight quasi-regular tetrahedra, six quad clusters, and no other 
standard clusters scores at most 8 pt. The case of equality is analyzed as well. The purpose 
of this section is to give a proof of the following theorem. 


Theorem 8.1 (Local Optimality). Let D be a contravening decomposition star. Let 
U(D) be the set of sphere packing vectors at distance at most 2t) from the origin. 
Assume that: 


1. The set U(D) has twelve elements. 

2. There is a bijection iy between U(D) and the kissing arrangement Ute. of twelve 
tangent unit balls in the fcc configuration, or a bijection with Unep the twelve 
tangent unit balls in the hcp configuration; such that for allv, w € U(D),|w—v| < 
2to if and only if |\Ww(w) — W(v)| = 2. That is, the proximity graph of U(D) is the 
same as the contact graph of Ufec or Unep. 


Then o(D) < 8 pt. Equality holds if and only if U coincides with Ufc, or Upcp up to a 
Euclidean motion. Decomposition stars D exist with U(D) = Use. and others exist with 
U(D) = Unep- 


Remark 8.2. This theorem is one of the key claims of Section 3.3. This theorem is 
phrased slightly differently from Claim 3.15 in Section 3.3. The reason for this is that 
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we have not formally introduced the plane graph G(D) of a decomposition star. (This 
happens in Section 20.2.) Once G(D) has been formally introduced, then Theorem 8.1 
can be expressed more directly, as follows. We let Gy.¢ and Ghcp be the plane graphs 
attached to the decomposition stars of vertices in the fcc and hcp, respectively. (These 
graphs are independent of the vertices selected.) 


Corollary 8.3 (Local Optimality—Second Version). Contravening decomposition 
stars exist. If D is a contravening decomposition star, and if the plane graph of D 
is isomorphic to Gfce OF Ghcp, then o(D) = 8 pt. Moreover, up to Euclidean motion, 
U(D) is the kissing arrangement of the twelve balls around a central ball in the fcc 
packing or the kissing arrangement of twelve balls in the hcp. 


The following theorem is also one of the main results of this section. It is a key part 
of the proof of local optimality. 


Theorem 8.4. A quad cluster scores at most 0, and that only for a quad cluster whose 
corners have height 2, forming a square of side 2. That is, on(D) < 0. Other standard 
clusters have strictly negative scores: or(D) < 0. 


The argument that the score of a quad cluster is nonpositive is general and can be 
used to prove that the score of any cluster attached to a non-triangular standard region 
(Definition 5.24) has nonpositive score. 


8.2. Rogers Simplices 


To prove Theorem 8.4, we chop the cluster (R, D) into small pieces and show that the 
“density” of each piece is at most 5,.. To prepare for this proof, this section describes 
various small geometric solids that have a density at most doc. The first of these is the 
Rogers simplex. 


Lemma 8.5. Let R(a,b,c) be a Rogers simplex, with 1 < a < b < c. It has a 
distinguished vertex (the terminal point of the edges of lengths a, b, and c), which we 
assume to be the origin. Let A(a, b,c) be the volume of the intersection of R(a, b, c) 
with a ball of radius | at the origin. Then the ratio 


A(a, b, c)/vol(R(a, b, c)) 


is monotonically decreasing in each variable. 


Proof. This is Rogers’s lemma, as formulated in Lemma 8.6 of [Ha6]. 


Lemma 8.6. Consider the Rogers simplex R(a, b, /2) with vertex at the origin. As- 
sume 1 <a < bandn(2,2,2)<b< J/2. Let A be the volume of the intersection of 
the simplex with a closed ball of radius | at the origin. Then 


A < Soc Vol(R(a, b, V2)). 
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Fig. 8.1. Some sets of low density. 


Equality is attained if and only if a = | and b = (2, 2, 2) or for a degenerate simplex 
of zero volume. 


Proof. This is a special case of Lemma 8.5. See the third frame of Fig. 8.1. 


Lemma 8.7. Consider the wedge of a cone 
W = Wa, zo) = (tx: O<t <1,x € P(a,z)} CR’, 
where P(a, zo) has the form 
P = {(x1, X2, x3): x3 = Zo, 2 + xf + x3 <2, 0 < x2 < oxy}, 
with Zo => 1. Let A be the volume of the intersection of the wedge with B(O, 1). Then 
A < doct VOI(W). 


Equality is attained if and only if W has zero volume. 
Proof. This is calculated in Section 4 of [Ha7]. See the second frame of Fig. 8.1. O 


Lemma 8.8. Let C be the cone at the origin over a set P, where P is measurable and 
every point of P has distance at least 1.18 from the origin. Let A be the volume of the 
intersection of C with B(O, 1). Then 

A < doct VOI(C). 


Equality is attained if and only if C has zero volume. 


Proof. The ratio A/vol(C) is at most 1/1.18° < Sot. See the first frame of 
Fig. 8.1. 
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8.3. Bounds on Simplices 


In this and future sections, we rely on some inequalities that are not proved in this paper. 
There is an archive of hundreds of inequalities that have been proved by computer. This 
full archive appears in [Ha16]. The justification of these inequalities appears in the same 
archive. (The proofs of these inequalities were executed by computer.) An explanation 
of how computers are able to prove inequalities can be found in [Hal4] and [Ha6]. 
Each inequality carries a nine-digit identifying number. To invoke an inequality, we 
State it precisely, and give its identifying number, e.g., CALC-123456789. The first of 
these appears in Lemma 8.10. Some results rely on a simple combination of inequalities, 
rather than a single inequality. To make it easier to reference a group of inequalities, the 
archive at [Ha16] gives a separate nine-digit identifier to certain groups of inequalities. 
This permits us to reference such a group by a single number. 


Definition 8.9. Recall that the constant pt, a point, is equal to 0 (S), where S is aregular 
tetrahedron with edges of length 2. We have pt = 4 arctan(./2/5) — 2/3 ~ 0.05537. 


Lemma 8.10. A quasi-regular tetrahedron S satisfies a(S) < 1 pt. Equality occurs if 
and only if the quasi-regular tetrahedron is regular of edge length 2. 


Proof. This is CALC-586468779. 


Remark 8.11. The reader who wishes to dig more deeply into this particular proof may 
do so. An early published proof of this lemma was not fully automated (see Lemma 9.1.1 
of [Ha6]). This early proof show by conventional means that o(S) < 1 pt in an explicit 
neighborhood of (2, 2, 2, 2, 2, 2). 


Lemma 8.12. A quarter in the Q-system scores at most 0. That is, o(Q) < 0. Equality 
is attained if and only if five edges have length 2 and the diagonal has length V8. 


Proof. Throughout the proof of this lemma, we refer to quarters with five edges of 
length 2 and one of length /8 as extremal quarters. We make use of the definition of o 
on quarters from Definition 7.8. The general context (that is, contexts other than (2, 1) 
and (4, 0)) of upright quarters is established by the inequalities* that hold for all upright 
quarters Q with distinguished vertex v: 


IA 


21 (Q) + s-voro(Q, v) — s-voro(Q, 0) 0, 
s-vor(Q, v) + s-vor(Q, 0) + s-vorp(Q, v) — s-vorg(Q, 0) < 0. 


Equality is attained if and only if the quarter is extremal. For the remaining quarters 
(that is, contexts (2, 1) and (4, 0)), it is enough to show that '(Q) < 0, if nt < V2 and 
s-vor(Q, v) < 0, if nt > V2. 


4 CALC-522528841 and CALC-892806084. 
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Consider the case n+ < /2. If Q is a quarter such that every face has circumradius 
at most /2, then> [(Q) < 0. Equality is attained if and only if the quarter is extremal. 
Because of this, we may assume that the circumradius of Q is greater than /2. The 
inequality n+(Q) < ./2 implies that the faces of Q along the diagonal have nonneg- 
ative orientation. The other two faces have positive orientation, by Lemma 5.17. Since 
(Definition 7.6) 


4 
47(Q) = }\s-vor(Q, vi), 
i=] 


itis enough to show that s-vor(Q) < 0. Since the orientation of every face is nonnegative 
and the circumradius is greater than J/2, s-vor(Q, J/2) is a Strict truncation of the V-cell 
in Q, so that 


s-vor(Q) < s-vor(Q, V2). 


We show the right-hand side is nonpositive. Let v be the distinguished vertex of Q. Let 
A be 7 the solid angle of Q at v. By the definition of s-vor(Q, /2), it is nonpositive if 
and only if 


A < Sect VOI(VC(Q, v) N B(v, V2)). (8.1) 


(VC(Q, 0) is defined in Section 7.1.) The intersection VC(Q, v) MN Biv, /2) consists 
of six Rogers simplices R(a, b, V2), three conic wedges (extending out to 2), and the 
intersection of B(v, 2) with a cone over v. By Lemmas 8.6-8.8, these three types of 
solids give inequalities like that of (8.1). Summing the inequalities from these lemmas, 
we get (8.1). 

Consider the case n+ > /2 ando = s-vor. If the quarter is upright, then® s-vor(Q) < 
0. The quarters achieving equality are extremal. Thus, we may assume the quarter is 
flat. If the orientation of a flat quarter is negative along the face containing the ori- 
gin and the diagonal, then’ s-vor(Q) < 0. The quarters achieving equality are ex- 
tremal. In the remaining case, the only possible face along which the orientation is 
negative is the top face. This means that the analytic continuation defining s-vor(Q) 
is the same as 


4(—doct VOI(X) + sol(X)/3), 


where X is the subset of the cone at v over Q consisting of points in that cone closer to 
v than to any other vertex of Q. The extreme point of X has distance at least /2 from v 
(since n+ and hence the circumradius of Q are at least /2). Thus, 


s-vor(Q) < s-vor(Q, /2). 


We have s-vor(Q, 2) < (as in the previous paragraph, by Lemma 8.6-8.8. If equality 
is attained, the wedges and cones must have zero volume, and each Rogers simplex must 


5 CALc-346093004. 
® CaLc-40003553. 
T CALC-5901405. 
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have the form R(1, 7(2, 2, 2), /2) (or zero volume). This happens exactly when the 
flat quarter has five edges of length 2 and a diagonal of length V/8. This completes the 
proof. 


Lemma 8.13. Let S be a simplex all of whose faces have circumradius at most J2. 
Assume that S is not a quasi-regular tetrahedron or quarter. Then s-vor(S) < 0. 


Proof. The assumptions imply that the orientation is positive along each face. Let v be 
the distinguished vertex of S. 

Assume first that there are at least two edges of length at least 2% at the origin or 
that there are two opposite edges of length at least 27). Then the circumradius b of each 
of the three faces at v is at least (2, 2%, 2) > 1.207. By the monotonicity properties 
of the circumradius of S, the simplex S has circumradius at least that of the simplex 
S(2, 2,2, 2,2, 2t9), which a calculation shows is greater than 1.3045. By definition, 
s-vor(S) < 0 if and only if 


sol(|$| A B(v, 1))/3 < doce vol(VC(S, 0)). 


This inequality breaks into six separate inequalities corresponding to the six Rogers’s 
simplices R(a, b, c) constituting VC(S, 0). Rogers’s lemma, Lemma 8.5, shows each of 
the six Rogers’s simplices has density at most that of R(1, 1.207, 1.3045), which is less 
than 5,.;. The result follows in this case. 

Now assume that there is at most one edge of length at least 2%) at the origin, and 
that there is not a pair of opposite edges of length at most 2f9. There are four cases up to 
symmetry, depending on which edges have length at least 2f9, and which have shorter 
length. Let S be a simplex such that every face has circumradius at most 2. We have® 
s-vor(S(y1, Yo,---, ¥6)) < O for (y1,..., ¥6) in any of the following four domains: 


[2t0, 812, 2t01°[2t, V81[2, 2t0], — [2t, 8112, 2t017[2t, VBP, 
[2, 2to]7[2t, V8)°[2, 2t0], [2, 2to}*[2%0, V8)°. O 


8.4. Breaking Clusters into Pieces 


As we stated above, the strategy in the proof of local optimality will be to break quad 
clusters into smaller pieces and then to show that each piece has density at most doct. 
There are several preliminary lemmas that will be used to prove that this decomposition 
into smaller pieces is well-defined. These lemmas are presented in this section. 


Lemma 8.14. Let T be a triangle whose circumradius is less than /2. Assume that 
none of its edges passes through a barrier in B. Then T does not overlap any barrier 


in B. 
8 CALC-629256313, CALC-917032944, CALC-738318844, and CALC-587618947. 
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Proof. By hypothesis no edge of T passes through an edge in the barrier. By Lemma 4.21, 
no edge of a barrier passes through 7. Hence they do not overlap. 


Lemma 8.15. Let T = {u, v, w} be a set of three vertices whose circumradius is less 
than /2. Assume that one of its edges {v, w} passes through a barrier b = {v,, v2, v3} 
in B. Then: 


e The edge {v, w} has length between 2to and V8. 
e The vertex u is a vertex of b. 
e One of the endpoints y € {v, w} is such that {y, v1, v2, v3} is a simplex in Q. 


Proof. The edge {v, w} must have length at least 2t9 by Lemma 4.19. If the edge {u, v} 
has length at least 27), it cannot pass through b because of Lemma 4.33. If it has length 
at most 2/9, it cannot pass through b because of Lemma 4.19. Hence {u, v} and similarly 
{u, w} do not pass through b. The edges of b do not pass through 7. The only remaining 
possibility is for u to be a vertex of b. 

If b is a quasi-regular triangle, Lemma 4.22 gives the result. If b is a face of a quarter 
in the Q-system, then Lemma 4.34 gives the result. 


Definition 8.16 (Law of Cosines). Consider a triangle with sides a, b, and c. The angle 
opposite the edge of length c is given as 


a ae: 
Ju(a2, b?, c?) 


with u(x, y, z) = —x* — y? — 27 + 2xy + 2yz + 2zx. 


arc(a, b,c) = arccos((a” +p = c”)/(2ab)) — - + arctan 


Lemma 8.17 (First Separation Lemma). Let v be a vertex of height at most \/8. Let v2 
and v3 be such that 


e 0, v, v2, and v3 are distinct vertices, and 


e 7(0, v2, v3) < V2. 


Then the open cone at the origin over the set B(O, /2) N Biv, /2) does not meet the 
closed cone C at the origin over the convex hull of {v2, v3}. 


Proof. Let D be the open disk spanning the circle of intersection of B(O, V2) and 
Biv, al 2) It is enough to show that this disk does not meet C. This disk is contained in 
Ba, af), and so we bound this ball away from the given cone. 

Assume for a contradiction that these two sets meet. Let v’ be the reflection of v 
through the plane P = {0, v2, v3}. 

If the closest point p in P to v lies outside C, then the edge constraints |v] < /8 
forces the closest point in C to lie along the edge {0, v2} or {0, v3}. Since |v2], |v3| < V8, 
this closest point has distance at least s/2 from v. Thus, we may assume that the closest 
point in P to v lies in C. 

Assume next that the closest point in P to v lies in the convex hull of 0, v2, and v3. We 
obtain an edge {v, v’} of length at most \/8 that passes through a triangle of circumradius 
less than /2. This contradicts Lemma 4.21. 
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Assume finally that the closest point lies in the cone over {v2, v3} but not in the 
convex hull of 0, v2, v3. By moving v toward C (preserving |v|), we may assume that 
|u—v2| = |v—v3| = 2. Stretching the edge {v2, v3}, we may assume that the circumradius 
of {0, v2, v3} is precisely 2. Since the closest point in P is not in the convex hull of 
{0, v2, v3}, we may move v2 and v3 away from v while preserving the circumradius and 
increasing the lengths |v — v2| and |v — v3|. By moving v again toward C, we may assume 
without loss of generality that |v2| = |v3| = 2 and |v2 — v3| = 4/8. We have reduced to a 
one-parameter family of arrangements, parametrized by |v|. We observe that the disk in 
the statement of the lemma is tangent to the segment {v2, v3} at its midpoint, no matter 
what the value of |v| is. Thus, in the extremal case, the open disk does not intersect the 
segment {v2, v3} or the cone C that it generates. This completes the proof. 


Lemma 8.18 (Second Separation Lemma). Let v; be a vertex of height at most 2to. Let 
v2 and v3 be such that 


e 0, vj, v2, and v3 are distinct vertices, 
e {0, v1, v2, v3} ¢ Qo, and 


e {0, v2, v3} is a barrier. 


Then the open cone at the origin over the set B(O, /2) N Biv, /2) does not meet the 
closed cone C at the origin over {v2, v3}. 


Proof. Since v; has height at most 2f9, and {0, v2, v3} is a barrier, it follows from 
Lemma 4.10 that {0, v;, v2, v3} is in the Q-system if |v; — v2| < 2f9 and |v; — v3] < 26. 
This is contrary to the hypothesis. Thus, we may assume without loss of generality that 
|v} = v2| > 2to. 

By arguing as in the proof of Lemma 8.17, we may assume that the orthogonal 
projection of v; to the plane P is a point in the cone C. Let vj be the reflection of 
v, through C. We have that either {v2, v3} passes through {0, v1, v}} or {v1, vj} passes 
through {0, v2, v3}. We may assume for a contradiction that |v; — v}| < /8. 

If {v2, v3} passes through {0, v1, vi} }, then v2 and v3 are anchors of the diagonal {v, vi} 
by Lemma 4.24. This gives the contradiction |v; — v2| < 2%. 

If {vj, v|} passes through {0, v2, v3}, then by Lemma 4.22 {0, v2, v3} is a face of a 
quarter. Moreover, v; and v; are anchors of the diagonal of that quarter by Lemma 4.24. 
Since |v; — v2| > 2%, the diagonal must not have v2 as an endpoint, so that the diagonal 
is {0, v3}. Lemma 4.34 forces one of |v; — v2| or |v; — v2| to be at most 29. However, 
these are both equal to |vj — v2| > 2f0, a contradiction. 


Definition 8.19. We define an enlarged set of simplices Qj. Let Q5 be the set of 
simplices S with a vertex at the origin such that either § € Qo, or S is a simplex with 
a vertex at the origin and with circumradius less than /2 such that none of its edges 
passes through a barrier. 


Lemma 8.20. The simplices in Qo do not overlap one another. 
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Proof. The simplices in Qo are in the Q-system and do not overlap. No edge of length 
less than \/8 passes through any edge of a simplex in Q) \ Qo, by Lemma 4.21. By 
construction, none of the edges of a simplex in Qj \ Qo can pass through a barrier, and 
this includes all the faces of Qo. Thus, there is no overlap. 


Definition 8.21. Let v be a vertex of height at most 2.36 = 2(1.18). Let C(v) be the 
cone at the origin generated by the intersection B(v, V2) N BO, V2). Define a subset 
C’(v) of C(v) by the conditions: 


1. x € C(v). 

2. x is closer to 0 than to v. 

3. x € BO, /2). 

4. x does not lie in the cone over any simplex in Qo. 

5. Forevery vertex u 4 0, v such that the face {0, wu, v} is a barrier or has circumradius 
less than /2 and such that none of the edges of this face pass through a barrier, we 
have that x and v lie in the same half-space bounded by the plane perpendicular 
to {0, wu, v} and passing through 0 and the circumcenter of {0, u, v}. 

6. For every simplex {0, vj, v2, v} € Qo, the segment {x, v} does not cross through 
the cone C({0, v1, v2}). 


Lemma 8.22. For every vertex v of height at most 2.36, we have C'(v) C VC(0). 


Proof. Assume for a contradiction that x € C’(v) N VC(u), with u 4 0. Lemma 5.20 
implies that x is unobstructed at 0. Thus |x — u| < |x| < af D. 

Assume that the hypotheses of Condition 5 in Definition 8.21 are satisfied. This, 
together with x € C(v) implies that n({0, u, v}) < /2. An element x that is closer to 0 
than to v and in the same half-space as v (in the half-space bounded by the perpendicular 
plane to {0, u, v} through O and the circumcenter of {0, u, v}) is closer to 0 than to u, 
which is contrary to x € VC(u). This completes the proof, except in the case that an 
edge of the triangle {0, u, v} passes through a barrier b. Assume that this is so. 

The edge {0, v} cannot pass through a barrier because it is too short (length less than 
2to). 

Suppose that the edge {u, v} passes through a barrier b. By Lemma 8.15 applied to 
T = {0, uw, v}, the origin is a vertex of b. There are three possibilities: 


1. x is obstructed from u by b. 
2. x is obstructed from v by b. 
3. x is not obstructed from either u or v by b. 


The first possibility runs contrary to the hypothesis x € VC(u). The second possibility, 
together with Lemma 8.18, implies that {v, b} is a simplex in the Q-system. This is 
contrary to Condition 6 defining C’(v). 

The third possibility is eliminated as follows. Every point in the half-space containing 
v and bounded by the plane of b 


e is obstructed at u by b, or 
e has distance at least /2 from u (because each edge of b has this property). 
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Since x has neither of these properties, we find that x must lie in the same half-space 
bounded by the plane of b as u. Let S be the simplex formed by b and v. If S ¢ Qo, then 
Lemma 8.18 shows that no part of the cone C(v) lies in the same half-space as u. So S € 
Qo. By Condition 6 on C’(v), the line from x to v does not intersect the cone at the origin 
over b. However, then the arc-length of the geodesic on the unit sphere running from the 
projection of x to the projection of v is at least arc(|v|, V8, 2) > are(|v|, /2, V2). This 
measurement shows that x lies outside the cone C(v), which is contrary to assumption. 

Suppose that the edge {0, uw} passes through the barrier b. By Lemma 8.15 applied to 
T = {0, u, v}, we get that v is a vertex of b. There are again three possibilities: 


1. x is obstructed from u by b. 
2. x is not obstructed from either u or 0 by b. 
3. x is obstructed from 0 by b. 


The first possibility runs contrary to the hypothesis x € VC(u). The second places x 
outside the convex hull of 0, b, u and gives |x — u| + |x| > 8, which is contrary to 
|x — u| < |x| < V2. The third possibility cannot occur by the observation made at the 
beginning of the proof that x is unobstructed at 0. O 


It follows from the definition that C’(v) is star convex at the origin. We make this 
more explicit in the following lemma. 


Lemma 8.23. Assume |v| < 2.36. Let F(v) be the intersection of Q(0) N Q(v); that 
is, the face of the Voronoi cell of Q(0) associated with the vertex v. Let F'(v) be the part 
of F(v) O B(O, 1.18) that is not in the cone over any simplex in Qo. Let H(v) be the 
closure of the union of segments from the origin to points of F'(v). Let C"(v) be the cone 
at the origin spanned by B(0, 1.18) N B(v, 1.18). Then the closure of C'(v) 1 C"(v) is 
equal to H(v). 


Proof. We have F'(v) C C”(v). 

First we show that F’(v) lies in the closure of C’(v). For this, we check that points 
of F’(v) satisfy the (closed counterparts of) Conditions 1-6 defining C’(v) (see Defi- 
nition 8.21). Conditions 1-4 are immediate from the definitions. If u is a vertex as in 
Condition 5, then the half-space it determines is that containing the origin and the edge 
of the Voronoi cell determined by u and v. Condition 5 now follows. Consider Condi- 
tion 6. Suppose that {x, v} crosses the cone {0, v;, v2} and that x € F’(v). (The point 
of intersection has height at most /2 and hence lies in the convex hull of {0, v1, v2}.) 
This implies that x is obstructed at v. By Lemma 5.22, this implies that |x — v| > t. 
Since x is equidistant from v and the origin, we find that |x| > f, which is contrary to 
x € BO, 1.18). 

To finish the proof, we show that C’(v) NC" (v) C H(v). For acontradiction, consider 
a point x € C’(v)C"(v) that is not in H(v). It must lie in the cone over some other face 
of the Voronoi cell; say that of u. The constraints force the circumradius of T = {0, v, u} 
to be at most 1.18. The edges of T are too short to pass through a barrier. Thus, Condition 
5 defining C’(v) places a bounding plane that is perpendicular to T and that runs through 
the origin and the circumcenter of T. This prevents x from lying in the cone over the 
face of the Voronoi cell attached to u. 
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Remark 8.24. In the lemma, it is enough to consider simplices along {0, w}, because 


arc(|v], V8, 2) > arc(|v], 1.18, 1.18). 


Corollary 8.25. [fx € VC(0), with O < |x| < 1.18, if the point at distance 1.18 from 
0 along the ray (0, x) does not lie in VC(O), and if x is not in the cone over any simplex 
of Qo, then there is some v such that x € C'(v), and |v| < 2.36. 


Proof. fx € VC(O) NM BO, 1.18), then x € 2(0)N B(O, 1.18) by Lemma 5.23. Also, 
x lies in the cone over some face F'(v) of the Voronoi cell (2(0). The hypotheses imply 
that x lies in the cone over F’(v). Lemma 8.23 implies that x € C’(v). 


Lemma 8.26. Assume that \u| < 2.36 and that \v| < 2.36. The sets C’(u), C’(v) do 
not overlap for u # v. 


Proof. Tf there is some x in the overlap, then the circumradius of {0, uw, v} is less than 
/2. If no edge of {0, u, v} passes through a barrier, then the defining conditions of C’() 
and C’(v) separate them along the plane perpendicular to {0, wu, v} and passing through 
the origin and the circumcenter of {0, u, v}. 

If some edge of {0, vu, v} passes through a barrier, then an argument like that in the 
proof of Lemma 8.22 shows they do not overlap. In fact, the edges {0, uw} and {0, v} are 
too short to pass through a barrier. Suppose the edge {u, v} passes through a barrier b. 
By Lemma 8.15 applied to T = {0, uw, v}, the origin is a vertex of b. If neither of the 
simplices {u, b} and {v, b} are in Qo, then the plane through b separates C’(u) from 
C’(v). Assume without loss of generality that S = {v, b} € Qo. By Condition 6 of the 
definition of C’ (Definition 8.21), the segment from x to v does not intersect the cone 
at the origin formed by b. As in the proof of Lemma 8.22, x lies outside the cone C(v); 
unless x and v lie in the same half-space formed by the plane of b. The cone C(u) 
intersects this half-space at x. By Lemma 8.18, we have {u, b} € Qo. Condition 6 in the 
definition of C’ now keeps x at distance at least ./2 from u. This completes the proof. 0 


Lemma 8.27. Let S be a simplex whose circumradius is less than 2. If five of the six 
edges of the simplex do not pass through a barrier, then the sixth edge e does not pass 
through a barrier either, unless both endpoints of the edge opposite e in S are vertices 
of the barrier. 


Proof. We leave this as an exercise. The point is that it is impossible to draw the 
barrier without having one of its edges pass through a face of S, which is ruled out by 
Lemma 4.21. 


8.5. Proofs 


We are finally prepared to give a proof of Theorem 8.4. We break the proof into two 
lemmas. 
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Lemma 8.28. Jf R is a standard region that is not a triangle, then or(D) < 0. 


Proof. This proof is an adaptation of the main result in [Ha7, Theorem 4.1]. We consider 
the V-cell at a vertex, which we take to be the origin. We will partition the V-cell into 
pieces. On each piece it will be shown that o is nonpositive. 

Throughout the proof we make use of the correspondence between or(D) < O and the 
bound of 6,.¢ on densities, on standard regions R (away from simplices in the Q-system). 
This correspondence is evident from Lemma 7.18, which gives the formula 


ORr(D) = 4 (—Soct vol VCz(D) + sol(R’)/3) + > o(Q,c(Q, D), 0). 
QeQo(R, D) 


If o(Q,c(Q, D),0) < 0, and vol VCr (D) 4 0 then or(D) < 0 follows from the 
inequality 


(sol(R’)/3)/vol VCx(D) < Soct- 


This is an assertion about the ratio of two volumes, that is, a bound 69,¢ on the density 
of VCR (D). 

The parts of VC(D) that lie in the cone over some simplex in Qp are easily treated. If 
S is in Qo, then it is either a quasi-regular tetrahedron or a quarter. If it is a quasi-regular 
tetrahedron, it is excluded by the hypothesis of the lemma. If it is a quarter, o(S) < 0 by 
Lemma 8.12. The parts of VC(D) that lie in the cone over some simplex in Q5\ Qo are 
also easily treated. The simplex S = {0, v, w, w’} has circumradius less than J/2. Use 
s-vor(S) on the simplex. Lemma 8.13 shows that s-vor(S) < 0 as desired. 

Next we consider the parts of VC(D) that are not in any C’(v) (with |v| < 2.36) and 
that are not in any cone over a simplex in Q). (Note that by Lemmas 8.17 and 8.18, if a 
cone over some simplex in Qj meets C’(v), then v must be a vertex of that simplex.) By 
Corollary 8.25, if x belongs to this set, then all the points out to radius 1.18 in the same 
direction belong to this set. By Lemma 8.8, the density of such parts is less than doct. 

Finally, we treat the parts of VC(D) that are in some C’(v) but that lie outside all 
cones over simplices in Q). 

Fix v of height at most 2.36. Let w,, w2,..., wz be the vertices w near {0, v} such 
that either {0, v, w} is a barrier or it has circumradius less than J/2, and such that none 
of its edges passes through a barrier. We view the triangles {0, v, w;} as a fan of triangles 
around the edge {0, v}. We assume that the vertices are indexed so that consecutive 
triangles in this fan have consecutive indices (modulo k). We will analyze the densities 
separately within each wedge, where a wedge is the intersection along the line {0, v} of 
half-spaces bounded by the half-planes {0, v, w;} and {0, v, w;,,}. Space is partitioned 
by these k different wedges. Fix i and write w = w;, w’ = w;41. Let S = {0, v, w, w’}. 

Let F be the convex planar region in the perpendicular bisector of {0, v} defined by 
the points inside the closure of C’(v), inside the wedge between {0, v, w} and {0, uv, w’}, 
closer to v than to w, and closer to v than to w’. This planar region is illustrated in Fig. 8.2. 
The edge e lies in the line perpendicular to {0, v, w} and through the circumcenter of 
{0, v, w}. It extends from the circumcenter out to distance ./2 from the vertices 0, v, w. 
If the circumradius of {0, v, w} is greater than /2, the edge e reduces to a point, and 
only the arc a at distance /2 from 0 and v appears. Similar comments apply to e’. 
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Fig. 8.2. A planar region. 


Case 1: Circumradius of S Is Less than /2. We show that this case does not occur. If 
none of the edges of this simplex pass through a barrier, then this simplex belongs to Q), 
a case already considered. By definition of the wedges, the edges {0, v}, {0, w}, {0, w’}, 
{v, w}, and {v, w’} do not pass through a barrier. Since five of the six edges do not pass 
through a barrier, and since S is formed by consecutive triangles in the fan around {0, v}, 
the sixth does not pass through a barrier either, by Lemma 8.27. 


Case 2: Circumradius of S Is at Least V2. Letr > 2 be the circumradius. We claim that 
the edge e cannot extend beyond the wedge through the half-plane through {0, v, w’}. 
In fact, the circumcenter of {0, v, w, w’} lies on the extension (in one direction or the 
other) of the segment e to a point at distance r from the origin. If this circumcenter does 
not lie in the wedge, then the orientation is negative along one of the faces {0, v, w} 
or {0, v, w’}. This face must have circumradius at least V2, by Lemma 5.18, and this 
forces the face to be a barrier. If the orientation is negative along a barrier, then the 
simplex {0, v, w, w’} is a simplex in OQ) (Lemmas 5.16 and 5.17). This is contrary to our 
assumption above that {0, v, w, w’} is not in Qo. 

These comments show that Fig. 8.2 correctly represents the basic shape of F’, with 
the understanding that the edges e and e’ may degenerate to a point. By construction, 
every point x in the open convex hull {F, 0} of F and 0 lies in C’(v) C VC(0). The 
convex hull { F’, 0} is the union of three solids, two Rogers simplices along the triangles 
{0, v, w} and {0, v, w’} respectively, and the conic solid given by the convex hull of the 
arc a, v/2 and 0. By Lemmas 8.6 and 8.7, these solids have density at most doct. 

This completes the proof that or(D) is never positive on nontriangular standard 
regions R. Note that the decomposition into the parts of cones C’(v) inside a wedge is 
compatible with the partition of the unit sphere into standard regions, so that the estimate 
holds over each standard region, and not just over the union of the standard regions. O 


Lemma 8.29. Jf R is a standard region that is not a triangle, and if op(D) = 0, then 
(R, D) is a quad cluster. Moreover, the four corners of R in the quad cluster have height 
2, forming a square of side 2. 


Proof. To analyze the case of equality, first we note that any truncation at 1.18 produces 
a strict inequality (Lemma 8.8 is strict if the volume is nonzero), so that every point must 
lie over a simplex in Qj or over some C’(v). We have s-vor(S) < 0 for simplices with 
circumradius less than /2. The only simplices in Qo that produce equality are those 
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with five edges of length 2 and a diagonal of length 8. Any nontrivial arc a produces 
strict inequality (see Lemma 8.7), so we must have that e and e’ meet at exactly distance 
/2 from 0 and v. Moreover, if e does not degenerate to a point, the corresponding 
Rogers simplex gives strict inequality, unless {0, v, w} is an equilateral triangle with 
side length 2. We conclude that the entire part of the V-cell over the standard region 
must be assembled from Rogers simplices R(1, 7(2, 2, 2), Dy and quarters with lengths 
(2,.2:2,.2.2, /8). This forces each vertex v of height at most 2%) to have height 2. It 
forces each pair of triangles {0, v1, v2} {0, v2, v3}, that determine consecutive edges along 
the boundary of the standard region to meet at right angles: 


dih(O, v2, vj, v3) = 0. 


This forces the object to be a quad cluster of the indicated form. 


We conclude the section with a proof of the main theorem. With all our preparations 
in place, the proof is short. 


Proof of Theorem 8.1 (Local Optimality). The hypothesis implies that there are six 
quad clusters and eight quasi-regular tetrahedra at the origin of the decomposition star. 
By Lemma 8.10, each quasi-regular tetrahedron scores at most | pt with equality if and 
only if the tetrahedron is regular with edge-length 2. By Theorem 8.4, each quad cluster 
scores at most 0, with equality if and only if the corners of the quad cluster form a square 
with edge-length 2 at distance 2 from the origin. Thus, o (D) is at most 8 pt. In the case of 
equality, there are twelve vertices at distance 2 from the origin, forming eight equilateral 
triangles and six squares (all of edge-length 2). These conditions are satisfied precisely 
when the arrangement is Us.¢ or Upcp up to a Euclidean motion. 


9. The S-System 
9.1. Overview 


In this section we define a decomposition of a V-cell. Let VC be the V -cell at the origin. 
For any t > 0, let V(t) be the intersection of VC with the ball B(O, t) at the origin of 
radius tf. We write VC as the disjoint union of V (fo) and its complement 6. 

Assume that there is an upright quarter in the Q-system with diagonal {0, v}. As 
usual, we call {0, v} an upright diagonal. We will define 6(v) C 6. It will be a subset of 
a set of the form C(D,) 6 for some subset D, of the unit sphere. The sets D, will be 
defined so as not to overlap one another for distinct v. Then the sets 6(v) do not overlap 
one another either. We will give an explicit formula for the volume of 5(v). 

We will define a set S of simplices, each having a vertex at the origin. (The letter “S” 
is for simplex.) The vertices of the simplices will be vertices of the packing, and their 
edges will have length at most 2/2. The sets C(S), for distinct § € S, will not overlap. 
Over a simplex S € S, the V-cell will be truncated at a radius ts > to. After defining the 
constants ts, we will set 


Vs(ts) = C(S)N Vits) = C(S) N B(ts) A VC(O). 
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That is, Vs (ts) is the part of the V-cell at the origin, contained in the cone over S and in 
the ball of radius ts. If VC(0) A C(S) C B(ts) C Bits), then Vs(ts) = Vs(t¢). 

Since ts > fo, the sets Vs(ts) and 6 may overlap. Nevertheless, we will show that 
Vs(ts) does not overlap any 6(v). Let V5 (to) be the set of points in V (fo) that do not lie 
in C(S), S € S. We will derive an explicit formula for the volume of VS (to). 

In VC(O), there are nonoverlapping sets 


5(v), Vs(ts), V°(to). 


Let 5’ be the complement in VC(0) of the union of these sets. These sets give a decom- 
position of VC(0). Corresponding to this decomposition is a formula for o(D) of the 
form 


o(D) = c-vor(V*(t)) + > c-vor(Vs(ts)) — > AS8oct VOL(S(V)) — 45ocrVol(5’). 
S v 


Since vol(6’) > 0, we obtain an upper bound on o (D) by dropping the rightmost term. 


9.2. The Set 5(v) 


Let {0, v} be the diagonal of an upright quarter in Q9. We define 6(v) C C(D,) 164 for 
an appropriate subset D, of the unit sphere. 


Definition 9.1. Set no(h) = n(2h, 2, 2t0). 


Ifh < V2, then no(h) < no(V2) < 1.453. 

Let Do be the spherical cap on the unit sphere, centered along {0, v} and having 
arcradius 0, where cos @ = |v|/(2no(\v|/2)). 

The area of Do is 27 (1 — cos @). Let vj, ..., vg be the anchors around {0, v} indexed 
cyclically. The radial projections of the edges {v, u;} (extended as necessary) slice the 
spherical cap into k wedges W;, between {v, u;} and {v, vj}, where j = i + 1 modk, so 
that Do = U W;. 


Definition 9.2. Let W be the set of wedges W = W; such that either 


1. W occupies more than half the spherical cap (so that its area is at least 2 (1—cos @)), 
or 

2. |v; — v;| = 2.77, rad(O, v, v;, vj) = yo(lv|/2), and the circumradius of {0, v;, v;} 
or {v, u;, vj} is = afd. 


Fix i, j, with 7 = i+ 1modk. If W = W; is a wedge in W, let {0, v;, v}t be the 
plane through the origin and the circumcenter of {0, v;, v}, perpendicular to {0, v;, v}. 
Skip the following step if the circumradius of {0, v;, v} is greater than no(|v|/2), but if 
the circumradius is at most this bound, let c; be the intersection of {0, v;, v}+ with the 
circular boundary of W. Extend W by adding to W the spherical triangle with vertices the 
radial projections of v, v;, and c;. Similarly, extend W with the triangle from {v, v;, c;}, 
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Fig. 9.1. An example of a set W° (shaded region). 


if the circumradius of {0, v;, v} permits. (An example of this is illustrated in Fig. 9.1.) 
Let W° be the extension of the wedge obtained by adding these two spherical triangles. 

We will define 5(v, W°) C C(W*) 16. Then 6(v) is defined as the union of 6(v, W°), 
for W ¢ W. Let 


Ey ={x:2x-w<w-wh}, 


for w = v, u;, v;. These are half-spaces bounding the Voronoi cell. Set Ez; = E,,. 


If Condition 2 holds, we let c be the radial projection of the circumradius of {0, v;, v;, v} 
to the unit sphere. The arclength from c to the radial projection of v is 6’, where 
cos 6’ = |v|/(2rad) < |v|/(2n9) = cos. 


We conclude that 0’ > @ and c does not lie in Do. 


Definition 9.3. In both Conditions 1 and 2, set 
A(v, W°) = (E, NE; Ej OC(W*)), 
d(v, W°) = Av, W°)\B(to). 
Remark 9.4. There are some degenerate cases in this construction depending on the 


number of anchors. If there is no anchor, then A(v, W°) is to be defined simply as 
(E, 1 C(W*)). If there is one anchor v,, then 


A(u, W°) = (Ey NE; NA C(W*)). 
Remark 9.5. The following remark applies when the points c; and c; have been con- 


structed, and is irrelevant when that step was skipped in the construction described above. 
Observe that 


E, NE; NE; 1 C(W*) 
is the union of four Rogers simplices 
R(\w|/2, nO, v, ve), no(lv|/2)), w=v,y, €=i,j, 


and a conic wedge over W between c; and c;. (The inequality 6’ > 6 implies that the 
Rogers simplices do not overlap.) 
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In general, we break A(v, W°) into an inner part A~ (uv, W°) (the part outside the 
Rogers simplices together with (as many as) two Rogers simplices along (0, v)), and the 
Rogers simplices R,,, for w = v;, v;. We take R,, to be the empty set, when there is no 
anchor w with n(0, v, w) < no(|v|/2). 

We present a series of lemmas that explore the geometry of the sets A(v, W°). In the 
next few lemmas we make use of a function ¢, which is defined as follows. 


Definition 9.6. If A is a set of vertices containing v, let e,(A,x) € A be given as 
the vertex w € A\{v} such that the ray from v through x intersects the perpendicular 
bisecting plane of {v, w} before that of any other w’ € A\{v}. If the ray from v through 
x does not intersect any of the planes, then we set ¢ to the default value v. In cases of 
ties, resolve the tie in any consistent manner. If x € (2(0) (the Voronoi cell at the origin), 
then x lies in the cone over the face attached to the vertex eg(A, x) € A. 

We define a function ¢’ in a similar fashion. Assume €,(A, x) = w, where the ray 
from v to x intersects the perpendicular bisector to {v, w} at x’. Set 


a (A, &) =eyp(A\wh 2). 


That is, move along the face of the Voronoi cell from x until another face is encountered. 
Let the corresponding vertex be the value of e’. If x € (0) in the cone over the face 
attached to the vertex w, and if w/2 lies on that face, then x’ lies in the sector of the face 
formed by the cone at w/2 generated by the edge of the Voronoi cell between the faces 
associated to w and €4(A, x). 


Lemma 9.7. Let S = {0,v, w, u} be a simplex. Assume that {0, v} is an upright di- 
agonal of a quarter in the Q-system, that w and v are anchors of {0, v}, and that 
rad(S) < no(|v|/2). Assume there is a wedge W of W along the face {0, v, w} (on the 
same side of the face as u). Let R,, be the Rogers simplex R(\w|/2, n(0, v, w), no(\v|/2)) 
along the face {0, w, v} along the edge of {0, w} on the same side of the face as u. Then: 


1. There exists an anchor w' between u and w with |w — w'| < 2to, and |w' — w| => 
2.77. 

2. {0, v, u, w’} is an upright quarter in the Q-system and its face {0,v, w'} is a 
barrier. 

3. The barrier {0, v, w’} obstructs every point of Ry from u. 


Proof. Since rad(S) < no(|v|/2) is contrary to the conditions defining wedges, the 
wedge must run from the face {0, v, w} to a face {0, v, w’}, where w’ is an anchor 
between w and wu. By the hypotheses defining wedges W € W, we have that the length 
of {u, w’} is at least 2.77. For the same reason, the circumradius of {0, v, w, w’} is at 
least no(|v|/2). 

We claim that R,, lies in the convex hull of S = {0, v, w, w’}. Since |w — w’| > 2.77, 
we see that the orientation of each face of {0, v, w, w’} is positive. Since rad(S) > no, 
we have 


R, C R(\w|/2, nO, v, w), rad(S)) CS. 


Thus it is enough to show that each point of the convex hull of S' is obstructed. 
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For this, it is enough to show that the extreme point w is obstructed from u by the 
barrier {0, v, w’}. In other words, we show that the edge {w, v} passes through {0, v, w’}. 
For this we show that no other geometrical configuration of points is possible. 

w’ is not in the convex hull of {0, v, w, u} by Lemma 4.15. The vertex w’ is not 
enclosed over {0, v, w, u} because 


EUS (22 0tac Mg. 8p, 20 A53)), 2. 2 OS DK, 


(The constant 1.453 is from Definition 9.1.) The edge {v, w’} does not pass through 
{0, w, u}, for otherwise we would reach a contradiction 


2no(lv|/2) > 2rad(S) > |u — w| > E(S(2, 2, 2, V8, 2, 2), 2, 2, 2.77) 
= 2(1.453) = 2n0(|v|/2). 


We conclude that {u, w} passes through {0, v, w’}. O 


Lemma 9.8. Let F = {0, uy, u2} be a quasi-regular triangle. Let {0, v} be the diagonal 
of an upright quarter in the Q-system. The set A(v, W°) does not overlap the cone at 0 
over the triangle F. 


Proof. We prove the lemma for the subsets A~(v, W°) and R,, in two separate cases, 
beginning with A~(v, W°). Let S be the simplex {0, u;, u2, v}. 

Assume that the orientation of S along F is negative. The simplex S is an upright 
quarter, so that wu; and uz are anchors of v. This is contrary to the construction of the 
wedges W in W. Thus, the orientation of F must be positive. 

Assume that rad(S) < o(|v|/2). Then again, uw; and uz are anchors. It follows that 
S is an upright quarter. As in the previous paragraph, this is contrary to construction. 
Thus, rad($) > o(|v|/2). 

We now have that the orientation of F is positive and that rad(S) > no(|v|/2). These 
two facts allow us to separate A~(v, W°) from {0, uw, u2, v} as follows. Each interior 
point x of A~(v, W°) has 


€0({0, uy, U2, v}, x) = v. 


Let Fo be the set of points in the intersection of &2(0) with the convex hull of F. Each 
point y in Fo has 


éo({0, uj, U2, v}, y) € {u1, Ur}. 


Since &9 takes distinct values on these two sets, they are disjoint. 

Next consider the subset R,,, in the case that w € {u,, u2}. To be definite, assume that 
w = uy. The same argument as above establishes that the orientation of F is positive 
and that rad($) > no(\v|/2). Each interior point x of R,, has 


eq ({0, u1, U2, v}, x) =v. 
However, each point y in Fo has 
€9({0, uy, U2, v}, y) € {u1, uz}. 


This proves this case. 
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Until the end of the proof, we may assume that w ¢ {u1, U2}. If the value of 
€0({0, v1, U2, w},-) is w on interior points of R,, and in {u;, v2} on Fo, then we have 
separated the sets. Assume to the contrary, that ¢ takes value w at a point x of Fo. Let 
S” = {0, u;, U2, w}. This assumption implies that the orientation of S” is negative along 
F.. This in turn implies that S” is a quasi-regular tetrahedron. The vertex v is not enclosed 
over S”, because the simplices in the Q-system do not overlap. For similar reasons, the 
face {0, v, w} does not overlap the simplex S$”. By the previous case (when w € {u, U2}), 
the interior of R,, does not intersect the faces of the quasi-regular tetrahedron S” along 
the edge {0, w}. These facts imply that the interior of R,, is disjoint from S”. In particular, 
it does not meet the cone at 0 over the triangle F’. 

Assume finally that ¢ takes value uw; or uz at an interior point y of Ry. (Say 1.) 
Let S’ = {0, v, w, u;}. Assume that R,, lies on the same side of {0, v, w} as uy. If 
rad($’) < no(\v|/2), then each point of R, is obstructed from u, (by Lemma 9.7). 
However, no point of Fo is obstructed from u;. Thus, R,, and Fo are disjoint in this 
case. Assume that rad(S’) > 79(\v|/2). If the orientation of {0, v, w} in S’ is negative, 
then S’ is a quarter and the result follows. Assume that the orientation is positive. Now 
€9(S’, x) = w for x € R,,, contrary to assumption. 

We may now assume that R,, lies outside the simplex S’ on the opposite side of the 
face {0, v, w}. This case is dismissed by Lemma 5.31, which guarantees that the interior 
of R, with ¢ = u, are obstructed from u, by {0, v, w}. However, none of the points of 
Fo with ¢ = uy, are obstructed from 1. 


Corollary 9.9. Each A(v, W°) lies entirely in the cone over the standard region that 
contains {0, v}. 


Proof. The cone over a standard region is bounded by the cones over the quasi-regular 
triangles. O 


Lemma 9.10. Let F = {0, uj, u2} be a triangle. Assume that \u,| < 2to, |u2| < 2t, 
and 2ty < |uy — u2| < V/8. Let {0, v} be the diagonal of an upright quarter in the 
Q-system. Assume that if u, and uz are both anchors of v, then they are consecutive 
anchors around v. Under these conditions, the set A(v, W°) does not overlap the cone 
at 0 over the triangle F. 


Proof. The proof is similar to that of Lemma 9.8. We prove the lemma for the subsets 
A (vu, W°) and R, in two separate cases, beginning with A~(v, W°). Let S be the 
simplex {0, u1, U2, v}. The orientation of S along F is positive. 

Assume that rad($) < no(|v|/2). Then uw, and wz are anchors. By the hypotheses of 
the lemma, they are consecutive anchors. By the rules defining A~ (v, W°), there is no 
wedge W* between uw and uw. Thus, the result follows in this case. 

We now have that the orientation of F is positive and that rad(S) > no(|v|/2). These 
two facts allow us to separate A~(v, W°) from the cone over {0, v1, v2, v} as follows. 
Each interior point x of A~ (v, W°) has 


€9({0, uj, U2, v}, x) =v. 
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Let Fo be the intersection of (2(0) with the convex hull of F. Each point y in Fo has 
Eo({0, ui, U2, V}, y) € {u1, Ud}. 


Next consider the subset R,,, in the case that w € {u 1, u2}. To be definite, assume that 
w = uy. The same argument as above establishes that the orientation of F is positive 
and that rad($) > no(|v|/2). Each interior point x of R,, has 


e9({0, uy, U2, VU}, x) = vz 
However, each point y in Fo has 
€9({0, uj, U2, v}, y) € {u1, uy}. 


This proves this case. 

Consider the subset R,,, in the case that w ¢ {u;, uz}. As in the previous proof, if the 
value of €9({0, w1, u2, w}, -) 1s w on interior points of R,, and in {u;, u2} on Fo, then we 
have separated the sets. Assume first to the contrary, that ¢9 takes value w at a point x 
of Fo. Let S” = {0, uy, u2, w}. Our assumption on ép implies that the orientation of S” 
is negative along F, so that S” is a flat quarter. The vertex v cannot be enclosed over S$”, 
for otherwise w, u,, and u2 would all be anchors of v, which would mean that there is no 
region W € W. Similarly, the triangle {0, v, w} cannot overlap the triangle {0, w1, u2}, 
for otherwise w, u;, and uz would again be anchors, contrary to the hypothesis that wu 
and uy are consecutive anchors. Now we invoke Lemma 9.8, to establish that R,, does 
not intersect S” and is therefore disjoint from Fo. 

Assume finally, that ¢9 takes value wu; or uz at an interior point y of R,,. (Say 1.) 
This case is identical to the parallel case in the proof of Lemma 9.8. 


Lemma 9.11. Let F = {0, uj, uz} be a triangle. Assume that 2t» < |u| < J/8, 
2 < |u2| < 2t0, and2 < |u; —up2| < 2fp. Let {0, v} be the diagonal of an upright quarter 
in the Q-system. Under these conditions, the set A(v, W°) does not overlap the cone at 
0 over the triangle F. 


Proof. Let S = {0,u,, U2, v}. The orientation of S along {0, u;, u2} is positive. The 
circumradius rad(S) is at least 79(0, v, v1) > no(\v|/2). 

We now have that the orientation of F is positive and that rad($) > no(\v|/2). We 
can then argue as in the proof of Lemmas 9.8 and 9.10, to get the result for A~ (v, W°) 
and R,, (with w = u2). 

Consider the case R,,, with w ~ uz. As in these earlier proofs, we may assume that 
&9 takes value w at a point x of Fo (or that ep takes value in {u,, uv} at a point y of R,,). 

In the case €9 = w at x € Fo, let S” = {0, u1, w, uz}. We have that ¢9 = w implies 
that the orientation of S” along {0, u1, u2} is negative. This in turn implies that S” is an 
upright quarter. It is checked without difficulty that v is not enclosed over S” and that 
the face {0, w, v} does not cross the face {0, u;, v2}. It follows from Lemma 9.8 and 
the already treated cases of this lemma that R,, cannot intersect S”. Thus, it does not 
intersect the face {0, u), u2} of S”. 

Finally, assume that ¢9 takes value in {u;, u2} at a point y of R,,. The orientation 
of the face {0, v, w} is positive in the simplex {0, v, w, u;} and the circumradius of 
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{0, v, w, uj} is greater than y0(|v|/2). This implies that ¢9 does not take the value uw. 
Assume that €) = uz. This case is excluded in the same manner as the parallel case in 
the earlier Lemmas 9.8 and 9.10. 


Lemma 9.12. Let {0, v} be an upright diagonal of a quarter in the Q-system. If x lies 
in the interior of A(v, W°), then x is unobstructed at 0. 


Proof. For acontradiction, assume that x is obstructed at 0 by barrier T = {u1, U2, u3}. 

The convex hull of T can be partitioned into three sets T (i) depending on which vertex 
of T is closest to a given point in the convex hull. (Ties can be resolved in any consistent 
manner.) Let y € A(v, W°) be the point in the convex hull of T on the segment from 0 
to x. Fix i so that y € T(i). If v = u;, then each point y of T (7) is closer to v than to 
0. However, each point of A(v, W°) is closer to 0 than to v. So x is not obstructed by T 
at 0. 

We may now assume that v ¥ u;. 

Partition R? geometrically into three sets V(u;), V(O), V(v) according to which of 
{u;, 0, v} a point z € R? is closest to. (Again resolve ties in any consistent manner.) 

Assume further that max; uj; > 2f). This implies that y € T(i) C V(v) U V(u;). On 
the other hand, we have by construction that y € A(v, W°) C V (0). (There are two cases 
involved in this conclusion, depending on whether u; is an anchor of {0, v}.) However, 
the sets V(-) are disjoint; and we reach a contradiction. Thus, under these assumptions, 
x is unobstructed at 0. 

Next assume that max; uj < 2f9. Let S = {0, uj, uz, u3}. Since T is a barrier, S € Qo. 
By assumption, {0, v} is a diagonal of an upright quarter in Qo. By the nonoverlap of 
quarters in Qo, we see that v is not enclosed over S$. The wedge W° on the unit sphere 
is spherically star convex with respect to the center v/|v|. Thus, if A(v, W°) intersects 
the convex hull of T at y, then A(v, W°) intersects the cone over a face {0, u;, u2} of S 
at y’. (We can take y’/|y’| to lie on the cone generated by the arc running from v/|v| to 
y/|y|. This is impossible by Lemmas 9.8 and 9.10. 


Lemma 9.13. Let {0, v} be the upright diagonal of a quarter in the Qo-system. Then 
the interior of A(v, W°) is a subset of VC(O). 


Proof. We begin by showing that A~(v, W°) C VC(O). Suppose to the contrary, that 
a point x in the interior of A~ lies in VC(w), with w 4 0. Then x is closer to w than 
to 0. Thus, 7(0, v, w) < no(|v|/2), and w is an anchor of {0, v}. The face E,, in the 
construction A~(v, W°) prevents this from happening. 

Now consider a point x of R,,, which we assume to lie in VC(u), with u 4 0. To 
avoid a trivial case, we may assume that w + u. 

Assume that the orientation of S = {0, v, w, u} is negative along the face {0, v, w}. 
Then S$ must be an upright quarter. By the construction of wedges W € W, we have that 
R, must lie on the opposite side of the plane {0, v, w} from u (for there is no wedge 
between the anchors of an upright quarter). The result now follows from Lemma 5.31. 

If rad($) < 7o(|v|/2), then uv and w are anchors. In this case the result follows from 
Lemma 9.7. 
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Finally, if the orientation is positive and if rad($) > n(|v|/2), then a point of R, 
cannot be closer to uw than to 0. 


9.3. Overlap 
Lemma 9.14. The sets A(v, W°) do not overlap one another. 


Proof. This is clear for two sets around the same vertex v. Consider the sets A(u, W°) 
and A(v, W°) at u and v. 

To treat the points in A~ (u, W°) and A” (v, W°), we may contract {u, v} until }u—v| = 
2. By the constraints on the edges of {0, u, v}, the circumcenter c of this triangle lies in the 
convex hull of the triangle. We have n(0, u, v) > no(\v|/2) and n(0, u, v) > no(\ul/2). 
So the plane through {0, c} perpendicular to the plane {0, u, v} separates A~ (u, W°) 
from A™(v, W°). 

Next we separate points in A~ (u, W°) from points of R“, where w is an anchor of 
vandu # v. Let S = {0, u, v, w}. The orientation of S along {0, v, w} is positive. The 
circumradius of S satisfies 


rad(S) > (0, u, v) > no(lv|/2). 


Thus, €9(S, -) takes different values on A~ (u, W°) and R®, so that the sets are disjoint. 
Next we separate points of R“ from R”. (Notice that we assume that the anchor is 
the same for the two Rogers simplices.) Let S = {0, u, v, w}. As above, we have 


rad(S) > no(v|/2), — no(\w|/2). 


The simplex S has positive orientation along the faces {0, u, w} and {0, v, w}. Let c, be 
the circumcenter of {0, u, w}, let c, be the circumcenter of {0, v, w}, and let c be the 
circumcenter of S. Then R) lies in the convex hull of {0, w, cy, c}, but R“ lies in the 
convex hull of {0, w, c,, c}. Thus, the sets are disjoint. 

Finally, we separate points of R“ from points of RY , Where w 4 w’ and u # v. If 
the function €9({0, w, w’}, -) separates the sets, we are done. Otherwise, we may assume 
say that e9({0, w, w’}, x) = w’ from some x € R“. Let S = {0, uv, w, w’}. 

If w’ is not an anchor of u, then rad($) > n9(\u|/2) and the orientation of S along 
{0, w, u} is positive. In this case we have ¢9 = w on R™, which is contrary to the 
assumption. Thus, we may assume that w’ is an anchor of uw. 

If the orientation of {0,u, w, w’} is negative along {0, w,u}, then {0, u, w, w’} 
is a quarter, contrary to the existence of W € W. So the orientation is positive. 
If rad({0, u, w, w’}) < no(\u|/2), then Lemma 9.7 implies that each point of R,, 
is obstructed from w’. However, no point of RY is obstructed from w. (In fact, a 
barrier that crosses A(v, W°) is inconsistent with Lemmas 9.8, 9.10, and 9.11.) So 
rad({0, u, w, w’}) > no(\u|/2). This is contrary to e9({0, w, w’}, x) = w’ from some 
xe Re, 
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9.4. The S-System Defined 


We consider three types of simplices: A, B, and C. Each type has its vertices at vertices 
of the packing. The edge lengths of these simplices are at most 2/2. 
Type A. This family consists of simplices S(y1, ..., yo) whose edge lengths satisfy 


V1, Y2, ¥3 € [2, 20], ya, Ys € [2f9, 2.77], 
yo €[2,2t], and n(ya, ys, ye) < V2. 


(These conditions imply y4, ys < 2.697, because 7(2.697, 2t, 2) > /2.) 

Type B. This family consists of certain flat quarters that are part of an isolated pair 
of flat quarters. It consists of those satisfying y2, y3 < 2.23, ya € [2tp, 24/2]. 

Type C. This family consists of certain simplices S(y1,..., y6) with edge lengths 
satisfying y;, y4 € [2f, 2/2], Y2, ¥3, 5, Yo € [2, 2f9]. We impose the condition that the 
first edge is the diagonal of some upright quarter in the Q-system, and that the upper 
endpoints of the second and third edges (that is, the second and third vertices of the 
simplex) are consecutive anchors of this diagonal. We also assume that y4 < 2.77, or 
that both face circumradii of S along the fourth edge are less than /2. 


Lemma 9.15. [fa vertex w is enclosed over a simplex S of type A, B, or C, then its 
height is greater than 2.77. Also, {0, w} is not the diagonal of an upright quarter in the 
Q-system. 


Proof. Incase A, n(y4, ys, Yo) < V2, so an enclosed vertex must have height greater 
than 2,/2. It is too long to be the diagonal of a quarter. 

In case B we use the fact that the isolated quarter does not overlap any quarter in 
the Q-system. We recall that a function €, defined in Section 4.2, measures the distance 
between opposing vertices in a pair of simplices sharing a face. An enclosed vertex has 
length at least 


ECS, 2, 2, 20/2, Din, Dt); 2ig: 2,2) > 2.77, 


By the symmetry of isolated quarters, this means that the diagonal of a flat quarter must 
also be at least 2.77. 

In case C the same calculation gives that the enclosed vertex w has height at least 
2.77. Let the simplex S' be given by {0, v, v1, v2}, where {0, v} is the upright diagonal. 
By Lemma 4.24, v; and v2 are anchors of {0, w}. The edge between w and its anchor 
cannot cross {v, v;} by Lemma 4.19. (Recall that two sets are said to cross if their radial 
projections overlap.) The distance between w and v is at most 2% by Lemma 4.32. If 
{0, w} is the diagonal of an upright quarter, the quarter takes the form {0, w, v1, v3}, or 
{0, w, v2, v3} for some v3, by Lemma 4.32. If both of these are quarters, then the diagonal 
{v;, V2} has four anchors v, w, 0, and v3. The selection rules for the Q-system place the 
quarters around this diagonal in the Q-system. So neither {0, w, v,, v3} nor {0, w, v2, v3} 
is in the Q-system. Suppose that {0, w, v;, v3} is a quarter, but that {0, w, v2, v3} is not. 
Then {0, w, v1, v3} forms an isolated pair with {v;, v2, v, w}. In either case the quarters 
along {0, w} are not in the Q-system. 
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Remark 9.16. The proof of this lemma does not make use of all the hypotheses 
on C. The conclusion holds for any simplex S(y1,..., yo), with yj, ya € [2f, 2/2], 
25 ¥3, Ys, Yo € [2, 2h]. 


9.5. Disjointness 


Let S = {0, v1, v2, v3} be a simplex of type A, B, or C. An edge {v4, vs} of length at 
most 2/2 such that |va|, |vs| < 2f cannot cross two of the edges {v;, vj} of S. In fact, it 
cannot cross any edge {v;, vj} with |v; |, |vj| < 2% by Lemma 4.30. The only possibility 
is that the edge {v4, vs} crosses the two edges with endpoint v;, with |v;| > 2fo in case 
C. However, this too is impossible by Lemma 4.372. 

Similar arguments show that the same conclusion holds for an edge {v4, vs} of length 
at most 2/9 such that |v4| < 2t0, vs < 2/2. The only additional fact that is needed is that 
{v4, vs} cannot cross the edge between the vertex v of an upright diagonal {0, v} and an 
anchor (Lemma 4.19). 


Lemma 9.17. Consider two simplices S, S’, each of type A, B, C, or a quarter in the 
Q-system. Assume that S and S' do not lie in the cone over a quadrilateral region. Then 
S and S' do not overlap. 


Proof. By hypothesis, the standard region is not a quadrilateral, and we thus exclude 
the case of conflicting diagonals in a quad cluster. We claim that no vertex w of S is 
enclosed over S’. Otherwise, w must have height at least 20, so that {0, w} is the diagonal 
of an upright in the Q-system, and this is contrary to Lemma 9.15. Similarly, no vertex 
of S’ is enclosed over S. 

Let {v, v2} be an edge of S crossing an edge {v3, v4} of S’. By the preceding remarks, 
neither of these edges can cross two edges of the other simplex. The endpoints of the 
edges are not enclosed over the other simplex. This means that one endpoint of each 
edge {v1, v2} and {v3, vg} is a vertex of the other simplex. This forces S and S’ to 
have three vertices in common, say 0, v2, and v3. We have S = {0, v1, v3, v2} and 
S’ = {0, v3, vo, va}. If |v2| € [2¢0, 2/2], then we see that the anchors v3, v4 of {0, v2} 
are not consecutive. This is impossible for simplices of type C and upright quarters. 
Thus, v2 and v3 have height at most 29. We conclude, without loss of generality, that 
|v4| € [2t0, 2/2] and |v; — vo] > 2zp. 

The heights of the vertices of S are at most 2%, so it has type A or B, or it is a 
flat quarter in the Q-system. If S’ is an upright quarter in the Q-system, then it does 
not overlap an isolated quarter or a flat quarter in the Q-system, so S has type A. This 
imposes the contradictory constraints on A: 


297 = lw = tel BEBO. 2 22, 2h, 2), 2,2: Dy & B77. 


Thus S’ has type C. This forces S to have type A. We reach the same contradiction 
2.77 > E > 2.77. 
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9.6. Separation of Simplices of Type A 


Let Vs = VC(0) 1 C(S), for a simplex S of type A, B, or C. We truncate Vs to Vs(ts) 
by intersecting Vs with a ball of radius ts. The parameters ts depend on S. 

If S has type A, we use ts = +00 (no truncation). If v is enclosed over S$ = 
{0, v1, V2, v3}, then since (v1, v2, v3) < /2, the face {VU , V2, v3} has positive orien- 
tation for S and {v, v1, v2, v3}. This implies that the V-cells at v and 0 do not intersect, 
and there is no need to truncate. If a simplex adjacent to S has negative orientation along 
a face shared with A, then it must be a quarter Q = {0, v4, v1, v2} (Lemma 5.13) or a 
quasi-regular tetrahedron. It cannot be an isolated quarter because of the edge length 
constraint 2.77 on simplices of type A. If it is in the Q-system, the face between S and 
the adjacent simplex is a barrier, and it does not interfere with the V-cell over S. Assume 
that it is not in the Q-system. There must be a conflicting diagonal {0, w}, where w is 
enclosed over Q. (w cannot be enclosed over S by results of Lemma 9.17.) This shields 
the V-cell at v4 from C(S) by the two barriers {0, w, v;} and {0, w, v2} of quarters in 
the Q-system. 

This shows that nothing external to a simplex of type A affects the shape of Vs (ts) 
(that is, VC(0) 1 C(S) consists of points of S that are closer to 0 than to the other vertices 
of S). Thus, Vs(ts) can be computed from S alone. Similarly, Vs (ts) does not influence 
the external geometry, since all faces have positive orientation. 

We also remark that Vs(ts) does not overlap any of the sets A(v, W°). This is evident 
from Lemmas 9.8 and 9.10. 

Our justification that Vs (ts) can be treated as an independently scored entity is now 
complete. 


9.7. Separation of Simplices of Type B 


If SQ, ..-, ye) has type B, we label vertices so that the diagonal is the fourth edge, 
with length y4. We set ts = 1.385. The calculation of € in Lemma 9.15 shows that any 
enclosed vertex over S has height at least 2.77 = 2ts. 

Vertices outside C(S) cannot affect the shape of Vs(ts). In fact, such a vertex v’ 
would have to form a quarter or quasi-regular tetrahedron with a face of S. The V-cell 
at v’ cannot meet C(S) unless it is a quarter that is not in the Q-system. However, by 
definition, an isolated quarter is not adjacent (along a face along the diagonal) to any 
other quarters. 

To separate the scoring of Vs(ts) from the rest of the standard cluster, we also show 
that the terms of formula (7.12) for Vs (ts) are represented geometrically by solids that lie 
in the cone C(S). This is more than a formality because S can have negative orientation 
along the face F formed by the origin and the diagonal (the fourth edge). 


Definition 9.18. Let 6,,(0) € [0, 2/2] be defined by the equations 
cos” py = (cos? wr cos? é)/A - cos” 0), for w<4@. 


Let p and q be points on the unit sphere separated by arclength 0. If we place a spherical 
cap of arcradius y on the unit sphere centered at p, then By, (@) is the angle at g between 
the arc (q, p) and the tangent to the cap which passes through q. 
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Let S = {0, vj, v2, v3}, where v; is the endpoint of the ith edge. We establish that 
the solids representing the conic and Rogers terms of formula (7.12) lie over C(S) by 
showing? that By (arc(y1, y3, ys)) < dih3(S(y1,...,¥6)), where dih3 is the dihedral 
angle along the third edge. We use cos fr = y,/2.77 and assume yo, y3 € [2, 2.23]. 

The reasons given in Section 9.6 for the disjointness of 5(v) and Vs(ts) apply to 
simplices of type B as well. This completes the justification that Vs(ts) is an object that 
can be treated in separation from the rest of the local V-cell. 


9.8. Separation of Simplices of Type C 


If SQ, ..., Ye) is of type C, we label vertices so that the upright diagonal is the first 
edge. We use ts = +00 (no truncation). Each face of S has positive orientation by 
Lemma 5.13. So Vs(ts) C S. 

Vertices outside S cannot affect the shape of Vs (ts). Any vertex v’ would have to form 
a quarter along a face of S. If the shared face lies along the first edge, it is a quarter Q in 
the Q-system, because one and hence all quarters along this edge are in the Q-system. 
The faces of this quarter are then barriers. If the shared face lies along the fourth edge, 
then its length is at most 2.77, so that the quarter cannot be part of an isolated pair. 
If it is not in the Q-system, there must be a conflicting diagonal. The two faces along 
this conflicting diagonal of the adjacent pair in the Q-system (that is, the pair taking 
precedence over Q in the Q-system) are barriers that shield the V-cell at v’ from S. 

The reasons given in Section 9.6 for the disjointness of 5(v) and Vs(ts) apply to 
simplices of type C as well. This completes the justification that Vs(ts) is an object that 
can be treated in separation from the rest of the local V-cell. 


9.9. Simplices of Type C' 


We introduce a small variation on simplices of type C, called type C’. We define a 
simplex {0, v, v1, va} of type C’ to be one satisfying the following conditions: 


1. The edge {0, v} is an upright diagonal of an upright quarter in the Q-system. 
2. |v2| € [2.45, 2¢o]. 

3. vy and v2 are anchors of v. 

4. |v — v»| € [2.45, 2¢0]. 

5. The edge {v, v2} is a diagonal of a flat quarter with face {0, v1, v2}. 


It follows that v; and v2 are consecutive anchors of {0, v}. 

On simplices S of type C’, we label vertices so that the upright diagonal is the first 
edge. We use ts = +00 (no truncation). Each face of S has positive orientation by 
Lemma 5.13. So Vs(ts) C S. 

Simplices of type C’ are separated from quarters in the Q-system and simplices of 
types A and B by procedures similar to those described for type C. The following lemma 
is helpful in this regard. 


9 CALC-193836552. 
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Lemma 9.19. The flat quarter along the face {0, v1, v2} is in the Q-system. 


Proof. 

E(S(2, 2, 2.45, 2V2, 2t0, 2to), 2, 2, 2) > 2V2, 
so nothing is enclosed over the flat quarter. 

E(S(2, 2, 2, 20/2, 2to, 2t0), 2p, 2.45, 2) > 2/2, 


so no edge between vertices of the packing can cross inside the anchored simplex. This 
implies that the flat quarter does not have a conflicting diagonal and is not part of an 
isolated pair. 


Similar arguments show that there is not a simplex with negative orientation along 
the top face of S. 

Unlike the other cases, there can in fact be overlap between A(v, W°) anda simplex of 
type C’, when the upright diagonal of the simplex is {0, v}. This is because the conditions 
defining a wedge W € W are not incompatible with the conditions defining type C’. 
Nevertheless, except in the obvious case where the simplex of type C’ and the wedge 
are both constructed between the same consecutive anchors of {0, v}, there can be no 
overlap of a A(v, W°) with a simplex of type C’. 


9.10. Scoring 


The construction of the decomposition of the V-cell VC(O) is now complete. It consists 
of the pieces 


e 5(v), for each diagonal {0, v} of an upright quarter in the Q-system, 

e truncations of Voronoi pieces Vs(t;) for simplices of type A, B, or C (and on rare 
occasion C’), 

e V°(to), the truncation at fo of all parts of VC(O) that do not lie in any of the cones 
C(S) over simplices of type A, B or C, 

e 6’, the part not lying in any of the preceding. 


By the results of Sections 9.6—9.8, o (D) can be broken into a corresponding sum, 


Or(D) = > o0(Q)+a(Vp), for quarters Q in the Q-system, where 
Q 


o(Vp) = c-vor(V> (to) + » c-vor(Vs(ts)) 


A,B,C 


— J > 48,<:vol(5p(v)) — 48cctVol(5'p). 


By dropping the final term, 46 .:vol(6;,), we obtain an upper bound ono (Vp). Because 
of the separation results of Sections 9.6—-9.7, we may score Ve (to) by formula (7.13). 
Bounds on the score of simplices of type B appear in CALC-193836552. 


164 


Sphere Packings, III 99 


Lemma 9.20. Let R be a standard region that is not a triangle in a decomposition start se 
D. t,r(D) = 0. 


Proof. Everything truncated at f) can be broken into three types of pieces: Rogers 
simplices R(a, b, to), wedges of to-cones, and spherical regions. (See Fig. 8.1.) The 
wedges of fo-cones and spherical regions can be considered as the degenerate cases b = fg 
and a = b = 9 of Rogers simplices, so it is enough to show that t(R(a, b, to)) > 0. We 
have to > ./3/2, so by Rogers’s lemma [Ha6, Lemma 8.6.2], 


T(R(a, b, to)) > TRC, (2, 2, 2), 3/2)). 


The right-hand side is zero. (In fact, the vanishing of the right-hand side is essentially 
Rogers’s bound. When Rogers’s bound is met, t = 0.) 


10. Bounds on the Score in Triangular and Quadrilateral Regions 
10.1. The Function t 


We consider the functions or(D) — A¢ sol(R) pt, for 7 = 0, 1, or 3.2, where R is a 
standard cluster. We write 


te(D) = sol(R)g pt — or(D). 


We will see that tg(D) has a simple interpretation. If D is a decomposition star with 
standard clusters {R}, set t(D) = }°p Tr(D). 


Lemma 10.1. tr(D) > 0, for all standard clusters R. 


Proof. If R is not a quasi-regular tetrahedron, then or(D) < 0 by Theorem 8.4 and 
sol(R) > 0, so that the result is immediate. If R is a quasi-regular tetrahedron, the result 
appears in the archive of inequalities CALC-53415898. O 


Lemma 10.2. 


o(D) = 4r¢ pt— t(D). 


Proof. Let {R} be the standard clusters in D. Then 


o(D) =) or, (D) + G -S s(t) ¢ pt=4nt pt— )- tr, (D). 
R R R 


Since 22.8 > 4¢ and 14.8 pt > 42¢ pt — 8 pt, we find as an immediate corollary 
that if there are standard clusters satisfying tr,(D) + --:+Tr,(D) => 14.8pzt, then D 
does not contravene. 

The function tr(D) gives the amount squandered by a particular standard cluster 
R. If nothing is squandered, then tr,(D) = 0 for every standard cluster, and the upper 
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bound is 47r¢ pt © 22.8 pt. To say that a decomposition star does not contravene is to 
say that at least (47 ¢ — 8) pt 14.8 pt are squandered. 


Remark 10.3. (This remark is not used elsewhere.) The bound o(D) < 4z¢ pt im- 
plies Rogers’s bound on density. It is the unattainable bound that would be obtained 
by packing regular tetrahedra around a common vertex with no distortion and no gaps. 
(More precisely, in the terminology of [Hal], the score sy = 47¢ pt corresponds to the 
effective density 167 8oc./(161 — 359) = V2/¢ © 0.7796, which is Rogers’s bound.) 
Every positive lower bound on some Tr (D) translates into an improvement on Rogers’s 
bound. 


Lemma 10.4. A triangular standard region does not contain any enclosed vertices. 


Proof. This fact is proved in [Ha6, Lemma 3.7]. 


10.2. Types 


Let v be a vertex of height at most 2%). We say that v has type (p, q) if every standard 
region with a vertex at v (the radial projection of v) is a triangle or a quadrilateral, and 
if there are exactly p triangular faces and q quadrilateral faces that meet at v. We write 
(Pv, dv) for the type of v. 

This section derives the bounds on the scores of the clusters around a given vertex as 
a function of the type of the vertex. Define constants tLp(p, g)/pt by Table 10.1. The 
entries marked with an asterisk will not be needed. 


Lemma 10.5. Let S;,...,S, and R,,...,.Rq be the tetrahedra and quad clusters 
around a vertex of type (p, q). Consider the constants of Table 10.1. Now, 


Pp 


q 
Yo (5) + Do te, (D) = tLP(. 4). 


Table 10.1. tLp(p, q)/pt 


q 
0 r 2 3 4 : 

p=0 * * 15.18 7.135 10.6497 22.27 
1 * * 6.95 7.135 17.62 32. 
2 * 8.5 4.756 12.9814 * * 
3 * 3.6426 8.334. 20.9 * * 
4 4.1396 3.7812 16.11 * * * 
5 0.55 11.22 * * * * 
6 6.339 * ‘ * * * 
7 14.76 * * * * * 
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Proof. Set 


(d}, t?) = ih(S;), T(S;)), dt) = (dih(R;), T(Ri)). 

The linear combination )\” t(S;)+- Tr, (D) is at least the minimum of )~? 1? +>? #} 

subject to >? d?+)~4 d} = 27 and to the system of linear inequalities CALC-830854305 

and the system of linear inequalities CALC-940884472 (obtained by replacing t and 

dihedral angles by t/ and d}). The constant trp (p, g) was chosen to be slightly smaller 

than the actual minimum of this linear programming problem. 
The entry t_p(5, 0) is based on Lemma 10.6, k = 1. 


Lemma 10.6. Let v1,..., vx, for some k < 4, be distinct vertices of a decomposition 
star of type (5,0). Let S,,..., 8S, be quasi-regular tetrahedra around the edges {0, v;}, 
fori <k. Then 
Y > r(S;) > 0.55k pt 
i=l 
and 
> o(S;) <r pt — 0.48k pt. 


i=1 


Proof. We have t(S) > 0, for any quasi-regular tetrahedron S. We refer to the edges 
y4, Ys, Yo Of a simplex S(}1,..., Yo) as its top edges. Set € = 2.1773. 

The proof of the first inequalities relies on seven calculations.!° Throughout the proof, 
we refer to these inequalities simply as inequality i, fori = 1,...,7. 

We claim (Claim 1) that if $;,...,.S5 are quasi-regular tetrahedra around an edge 
{0, v} andif $; = S(y1,..., v6), where ys > & is the length of a top edge e on S; shared 
with S$», then yy T(S;) > 3(0.55) pt. This claim follows from inequalities | and 2 if 
some other top edge in this group of quasi-regular tetrahedra has length greater than &. 
Assuming all the top edges other than e have length at most &, the estimate follows from 
x dih(S;) = 27 and inequalities 3 and 4. 

Now let S;,..., Sg be the eight quasi-regular tetrahedra around two edges {0, v1}, 
{0, v2} of type (5, 0). Let S, and Sz be the simplices along the face {0, v;, v2}. Suppose that 
the top edge {v;, v2} has length at least €. We claim (Claim 2) that x T(S;) > 4(0.55) pt. 
If there is a top edge of length at least € that does not lie on S; or $2, then this claim 
reduces to inequality | and Claim 1. If any of the top edges of S; or Sz other than {v1, v2} 
has length at least €, then the claim follows from inequalities | and 2. We assume 
all top edges other than {v;, v2} have length at most €. The claim now follows from 
inequalities 3 and 5, since the dihedral angles around each vertex sum to 27. 

We prove the bounds for t. The proof for o is entirely similar, but uses the constant 
€ = 2.177303 and seven new calculations!! rather than the seven given above. Claims 
analogous to Claims | and 2 hold for the o bound by this new group of seven inequalities. 


10 CaLc-636208429. 
'l CaLc-129662166. 
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Consider t for k = 1. If a top edge has length at least &, this is inequality 1. If all top 
edges have length less than &, this is inequality 3, since dihedral angles sum to 277. 

We say that a top edge lies around a vertex v if it is an edge of a quasi-regular 
tetrahedron with vertex v. We do not require v to be the endpoint of the edge. 

Take k = 2. If there is an edge of length at least € that lies around only one of v; and 
v2, then inequality | reduces us to the case k = 1. Any other edge of length at least & is 
covered by Claim 1. So we may assume that all top edges have length less than €. Then 
the result follows easily from inequalities 3 and 6. 

Take k = 3. If there is an edge of length at least € lying around only one of the 
v;, then inequality 1 reduces us to the case k = 2. If an edge of length at least & lies 
around exactly two of the v;, then it is an edge of two of the quasi-regular tetrahedra. 
These quasi-regular tetrahedra give 2(0.55) pt, and the quasi-regular tetrahedra around 
the third vertex v; give 0.55 pt more. If a top edge of length at least € lies around all three 
vertices, then one of the endpoints of the edge lies in {v;, v2, v3}, so the result follows 
from Claim 1. Finally, if all top edges have length at most €, we use inequalities 3, 6, 
and 7. 

Take k = 4. Suppose there is a top edge e of length at least &. If e lies around only 
one of the v;, we reduce to the case k = 3. If it lies around two of them, then the two 
quasi-regular tetrahedra along this edge give 2(0.55) pt and the quasi-regular tetrahedra 
around the other two vertices v; give another 2(0.55) pt. If both endpoints of e are among 
the vertices v;, the result follows from Claim 2. This happens in particular if e lies around 
four vertices. If e lies around only three vertices, one of its endpoints is one of the vertices 
vu;, say v;. Assume eé is not around v2. If v2 is not adjacent to v;, then Claim 1 gives the 
result. So taking v; adjacent to v2, we adapt Claim 1, by using all seven inequalities, to 
show that the eight quasi-regular tetrahedra around v, and v2 give 4(0.55) pt. Finally, if 
all top edges have length at most €, we use inequalities 3, 6, and 7. 


In a special case, the constant of Lemma 10.6 can be improved by a small amount. 
This small improvement will be used in Paper V. 


Lemma 10.7. Let v be a vertex of a decomposition star of type (5,0). Let S,,..., Ss 
be quasi-regular tetrahedra around the edge {0, v}. Then 


B) 
y\o(5)) < 4.52 pt— 107°. 


i=l 


Proof. If any of the top edges has length greater than €, we use a slightly improved 
calculation!” that yields this constant. Otherwise, the same calculation!? that was used 
in the previous lemma gives the desired estimate 


2, < 5(0.31023815) — 2 (0.207045) < 4.52 pt— 10°°. 


12 CaLc-241241504-1. 
13 CaLc-82950290. 
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10.3. Limitations on Types 


Recall that a vertex of a planar map has type (p,q) if it is the vertex of exactly p 
triangles and g quadrilaterals. This section restricts the possible types that appear in a 
decomposition star. 

Let t4 denote the constant 0.1317 © 2.37838774 pt. 


Lemma 10.8. Jf R is a quad cluster, then 


TR(D) = ta. 
Proof. A calculation" asserts precisely this. O 


Lemma 10.9. The following eight types (p,q) are impossible: (1) p => 8, (2) p => 6 
and q > 1, (3) p 2 5 andq > 2, (4) p= 4 and q > 3, (5) p = 2andq > 4, (6) p= 0 
and q = 6,(7) p < 3 andq =0,(8) p < landq =1. 


Proof. Calculations'> give a lower bound on the dihedral angle of p simplices and q 
quadrilaterals at 0.8638 p + 1.153g and an upper bound of 1.874445 p + 3.247q. If the 
type exists, these constants must straddle 27. One readily verifies in Cases 1-8 that these 
constants do not straddle 277. 


Lemma 10.10. Jf the type of any vertex of a decomposition star is one of (4, 2), (3, 3), 
(1, 4), (1, 5), (0, 5), (0, 2), then the decomposition star does not contravene. 


Proof. According to Table 10.1, we have tLhp(p,q) > (4¢ — 8)pt, for (p,q) = 
(4, 2), (3, 3), , 4), C1, 5), (0, 5), or (0, 2). By Lemma 10.2, the result follows in these 
cases. 


Remark 10.11. In summary of the preceding two lemmas, we find that we may restrict 
our attention to the following types of vertices: 


(7, 0) 
(6, 0) 
(5,0) (5,1) 
(4,0) 4,1) 

(3,1) (3,2) 

(2,1) (2,2) (2,3) 

d,2) (d,3) 
(0,3) (0,4) 


It will be shown in Corollary 12.3, that the type (7, 0) does not occur in a contravening 
decomposition star. 


'4 CaLC-996268658. 
'5 CALC-657406669, CALC-208809199, CALC-984463800, and CALC-277330628. 
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10.4. Bounds on the Score in Quadrilateral Regions 


If the quad cluster has a diagonal of length at most /8 between two corners, there are 
three possible decompositions. (1) The two quarters formed by the diagonal lie in the 
Q-system so that the scoring rules for the Q-system are used. (2) There is a second 
diagonal of length at most \/8, and we use the two quarters from the second diagonal for 
the scoring. (3) There is an enclosed vertex that makes the quad cluster into a quartered 
octahedron and the four upright quarters are in the Q-system. 

Now suppose that neither diagonal is less than \/8 and the quad cluster is not a 
quartered octahedron. If there is no enclosed vertex of length at most /8, the quad 
cluster contains no quarters. An upper bound on the score of the quad cluster (P, D) is 
vorp(D, /2). The remaining cases are called mixed quad clusters. Mixed quad clusters 
enclose a vertex of height at most \/8 and do not contain flat quarters. 


Lemma 10.12. Assume a figure exists with vertices v,,..., 04, v subject to the con- 
straints 

2 < |u;| < 200, 

2 < |v; — visil < 2%, 

2 < |v; — vj421, 

h, < |v—-v;l, 

2 < |v| < 2h, for i=1,...,4 (mod4), 


where h; are fixed constants that satisfy h; € [2,2 /2). Let L be the quadrilateral on 
the unit sphere with vertices v;/|v;| and edges running between consecutive vertices. 
Assume that v lies in the cone at the origin obtained by scaling L. Then another figure 
exists made of a (new) collection of vectors v1,..., v4 and v subject to the constraints 
above together with the additional constraints 
lu; — vi4i| = 20, 
|u;| = 2, for i=1,...,4, 
|v] = 2%. 


Moreover, the quadrilateral L may be assumed to be convex. 


Proof. This lemma in pure geometry is a special case of Lemma 4.3 of [Ha6]. 


Lemma 10.13. A quadrilateral region does not enclose any vertices of height at 
most 2to. 


Proof. Let v;,..., v4 be the corners of the quad cluster, and let v be an enclosed 
vertex of height at most 27). We cannot have |v; — v| < 2% for two different vertices 
v;, because two such inequalities would partition the region into two separate standard 
regions instead of a single quadrilateral region. 


170 


Sphere Packings, II 105 


We apply Lemma 10.12 to assume 


|v; — vi41] = 210, |v;| = 2, |v| = 2t0, 
fori = 1,...,4. Reindexing and perturbing v as necessary, we may assume that 2 < 
|v; — vl < 2% and |v; — v| > 2%, fori = 2,3,4. Moving v, we may assume it 


reaches the minimal distance to two adjacent corners (2 for v; or 2% for v;,i > 1). 
Keeping v fixed at this minimal distance, perturb the quad cluster along its remaining 
degree of freedom until v attains its minimal distance to three of the corners. This is a 
rigid figure. There are four possibilities depending on which three corners are chosen. 
Pick coordinates to show that the distance from v to the remaining vertex violates its 
inequality. 


Lemma 10.14. The score of a mixed quad cluster is less than —1.04 pt. 


Proof. Any enclosed vertex in a quad cluster has length at least 2t9 by Lemma 10.13. 
In particular, the anchors of an enclosed vertex are corners of the quad cluster. There are 
no flat quarters. 

We generally truncate the V-cell at s/2 as in the proof of Theorem 8.4. By that lemma, 
it breaks the V-cell into pieces whose score is nonpositive. Thus, if we identify certain 
pieces that score less than —1.04 pt, the result follows. Nevertheless, a few simplices 
will be left untruncated in the following argument. We will leave a simplex untruncated 
only if we are certain that each of its faces has positive orientation and that the simplices 
sharing a face F with S either lie in the Q-system or have positive orientation along F. 
If these conditions hold, we may use!® the function s-vor on S rather than truncation 
S-VOT9. 

In this proof, by enclosed vertex, we mean one of height at most 2/2. Let v be 
an enclosed vertex with the fewest anchors. If there are no anchors, the right circular 
cone C(h, no(h)) (aligned along {0, v}; see Definition 7.22) belongs to VC(O), where 
no(h) = n(2h, 2, 29) as in Definition 9.1 and |v| = 2A. In fact, if such a point lies 
in VC(u), with wu ~ v, then uw must be a corner of the quad cluster or an enclosed 
vertex of height at least 2/9. In either case the right circular cone belongs to VC(0). By 
formula (7.11), the score of this cone is 27 (1 — h/no(h))p(h, no(h)). An optimization 
in one variable gives an upper bound of —4.52 pt, for t9 < h < V2. This gives the bound 
of — 1.04 pr in this case. 

If there is one anchor, we cut the cone in half along the plane through {0, v} per- 
pendicular to the plane containing the anchor and {0, v}. The half of the cone on 
the far side of the anchor lies under the face at v of the V-cell. We get a bound 
of —4.52 pt/2 < —1.04 pt. 

To treat the remaining cases, we define a function K (S) on certain simplices S with 
circumradius at least /2. Let S = S(y1, Y2,---, 6). Let R(a, b,c) denote a Rogers 
simplex. Set 


K(S) = Ko(y1, y2, yo) + Ko(v1, y3, Ys) + dih(S)(1 — y:/V8)9(1/2, V2), (10.1) 


16 CALC-185703487, CALC-69785808, and CALC-104677697. 
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Fig. 10.1. The set measured by the function K(S). 


where 


KoO1, y2; Yo) = r-vor(R(y1/2, n(y1, y2; Yo), J2)) 
+ r-vor(R(y2/2, (v1, y2, Ye), V2)) 
— dih(R(y1/2, n(1, y2, ye), V2) — y1/V8)9(1/2, V2). 


(If the given Rogers simplices do not exist because the condition 0 < a < b < cis 
violated, we set the corresponding terms in these expressions to 0.) The function K (S$) 
represents the part of the score coming from the four Rogers simplices along two of the 
faces of S, and the conic region extending out to /2 between the two Rogers simplices 
along the edge y, (Fig. 10.1). This region is closely related to the solids A(v, W°) of 
Section 9.3, with the difference that the solids A lie in a ball of radius 79(|v|/2), but the 
solids here are truncated at /2. 

In the remaining cases, each enclosed vertex has at least two anchors. Each anchor 
is a corner of the quad cluster. Fix an enclosed vertex v. Suppose that v;, a corner, is 
an anchor of v. Assume that the face {0, v, v;} bounds at most one upright quarter. We 
sweep around the edge {0, v,}, away from the upright quarter if there is one, until we 
come to another enclosed vertex v’ such that {0, v;, v’} has circumradius less than /2 
or such that v; is an anchor of {0, v’}. If such a vertex v’ does not exist, we sweep all the 
way to v2 acorner of the quad cluster adjacent to v1. 

If v’ exists, then various calculations!’ give the bound —1.04 pt, depending on the 
size of the circumradius of {0, v, v’}. This allows us to assume that we do not encounter 
such an enclosed vertex uv’ whenever we sweep away, as above, from the face formed by 
an anchor. 

Now consider the simplex S = {0, v;, v2, v}, where v; is an anchor of {0, v}. We 
assume that it is not an upright quarter. There are three alternatives. The first is that 
S decreases the score of the quarter by at least 0.52 pt. Calculations!* show that this 
occurs if the circumradius of the face {0, v, v2} is less than af 2: or if the circumradius 
of the face is greater than 2, provided that the length of {v, v;} is at most 2.2. The 
second alternative! is that the face {0, v, v1} of S is shared with a quarter Q and that 
S and Q taken together bring the score down by 0.52 pt. In fact, if there are two such 
simplices S and S’ along Q, then the three simplices Q, S, and S’ pull the score”? below 


!7 CaALc-104677697, CALC-69785808, CALC-586706757, and CALC-87690094. 
18 CaLc-185703487 and CALC-441195992. 

'9 CaLc-848147403, CALC-969320489, and CALC-975496332. 

20 CALC-766771911. 
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—1.04 pt. The third alternative is that there is a simplex S’ = {0, v, v, v3} sharing the 
face {0, v, v;}, which, like S, scores less than —0.31 pt. In each case, S and the adjacent 
simplex through {0, v, v;} score less than —0.52 pt. Since v has at least two anchors, the 
quad cluster scores less than 2(—0.52) pt = —1.04 pt. oO 


10.5. A Volume Formula 


In Definition 9.3 we found a solid 6(v, W°) that lies outside the ball of radius fp at 0 but 
inside VC(0). We now develop a formula for its volume. 
Set Go = (to, to) © —0.5666. We define 


crown(h) = 27(1 — h/no(h))(g(h, no(h)) — Go). (10.2) 


It is equal to —46o¢¢ times the volume of the region outside the sphere of radius fo and 
inside the finite cone C(h, no(h)). If v is an enclosed vertex of height 2h € [2to, /8], 
such that every other vertex v’ of the standard cluster satisfies 


n(luj, v'l, lv — v'|) = no(h), 


then the solid represented by crown(|v|/2) lies outside the truncated V-cell, but inside 
the V-cell, so that if P is a quad cluster, 


c-vor(Vp) < c-voro(Vp) + crown(|v|/2). 


If a vertex v’ satisfies n(|v|, |v’|, |v — v’|) < no(h), then by the monotonicity of the 
circumradius of acute triangles, v’ is an anchor of v. This anchor clips the crown just 
defined, and we add acorrection term anc(|v’|, |v|, |v—v"|) toaccount for this. Figure 10.2 
illustrates the terms in the definition of anc(). 


Set 
anc(y}, Y2, Ye) = — dih(R,) crown(y/2)/(27) — sol(R1)go + 1-vor(R}) 
— dih(R2)(1 — y2/2to)(@(¥2/2, to) — Go) 
— sol(R2)¢o + r-vor(R2), (10.3) 


Fig. 10.2. An illustration of the terms anc. 
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where R; = R(y;/2, 701, Y2, Yo), No(1/2)), for i = 1, 2. In general, there are Rogers 
simplices on both sides of the face {0, v, v’), and this gives a factor of 2. For example, 
if v has a single anchor v’, then 


c-vor(Vp) < c-voro(Vp) + crown(|v|/2) + 2anc(|v|, |v’|, |v — v'|). 


However, if the anchor gives a face of an upright quarter, only one side of the face lies 
in the V-cell, so that the factor of 2 is not required. For example, v’ has context (2, 1) 
with upright quarter Q, and if there are no other enclosed vertices, and if v’, v” are the 
anchors along the faces of the quarter, then 


c-vor(Vp) < c-voro(Vp) + (1 — dih(Q)/(277)) crown(|v|/2) 


+ anc(|v], |v'], |v — v'|) + anc(|v], |v], |v — v")). 


In general, when there are multiple anchors around the same enclosed vertex v, we add a 
term (2 — k) anc for each anchor, where k € {0, 1, 2} is the number of quarters bounded 
by the face formed by the anchor. We must be cautious (see Condition 2 in Definition 9.2 
in the use of this formula. If the circumradius of {0, v, v’, v”} is less than 79(|v|/2), the 
Rogers simplices used to define the terms anc() at v’ and v” overlap. When this occurs, 
the geometric decomposition on which the correction terms anc() are based is no longer 
valid. In this case, other methods must be used. 

If (P, D) is a mixed quad cluster, let (P, D’) be the new quad cluster obtained by 
removing all the enclosed vertices. We define a V-cell V(P, D’) of (P, D’) and the 
truncation of V(P, D’) at tg. We take its score vorg, p(D’) as we do for standard clusters. 
(P, D’) does not contain any quarters. 


Lemma 10.15. Jf (P, D) is a mixed quad cluster, op(D’) < voro,p(D). 


Remark 10.16. The special case of the proof where an upright quarter has context 
c(Q) = (2,1) will be applied in Section 11.2 in situations other than mixed quad 
clusters. 


Proof. Suppose there exists an enclosed vertex that has context (2, 1); that is, there is a 
single upright quarter Q = S(y,, y2,..., ye) and no additional anchors. In this context 
o(Q) = pu(Q). Let v be the enclosed vertex. To compare op(D) with voro,p(D’), 
consider the V-cell near Q. The quarter Q cuts a wedge of angle dih(Q) from the crown 
at v. There is an anchor term for the two anchors of v along the faces of Q. Let Vp be 
the truncation at height f9 of Vp near v and under the four Rogers simplices stemming 
from the two anchors. (Figure 10.3 shades the truncated parts of the quad cluster.) As a 
consequence 


c-vor(Vp) < (1 —dih(Q)/(2z)) crown(y;/2) + anc(y1, y2, ye) 
+ anc(y1, y3, ¥5) + c-vor(Vp). (10.4) 
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Fig. 10.3. The terms anc near an upright quarter. 


Combining this inequality with calculations,”! we find 
c-vor(Vp) + “(Q) < c-vor(Vp) + s-voro(Q). (10.5) 


Now suppose there is an enclosed vertex v with context (3, 1). Let the quad cluster 
have corners v1, V2, V3, v4, ordered consecutively. Suppose the two quarters along v are 
Q, = {0, v, v1, v2} and Q2 = {0, v, v2, v3}. We consider two cases. 


Case 1: dih(Q,;) + dih(Q2) < z or rad(0, v, v1, v3) > n(lv|, 2, 20). In this case the 
use of correction terms to the crown are legitimate as in Definition 9.2. Proceeding as in 
context (2, 1), we find that 


ce-vor(Vp) < (1 — (dih(Q1) + dih(Q2))/(27)) crown(|v|/2) 
+ anc(F;) + anc(F2) + c-vor(V>). (10.6) 
Here V; is defined by the truncation at height f) under the V-face determined by v and 
under the Rogers simplices stemming from the side of F; that occur in the definition of 


anc. Also, anc(F;) = anc();, yj, yx) for a face F; with edges y,; along an upright quarter. 
By acalculation”” applied to both Q; and Q>, we have 


2 2 
c-vor(Vp) + > a(Q;) < c-vor(Vp) + > s-voro(Q;). (10.7) 
i=l i=l 


That is, by truncating near v, and changing the scoring of the quarters to s-vorg, we 
obtain an upper bound on the score. 


Case 2: dih(Q,) + dih(Q2) > m and rad(0, v, v1, v3) < no(|v|/2). The anchor terms 
cannot be used here. In the mixed case, V8 < |v; — v3], SO 


V2 < 3 vu) — 3] < rad < no(lvl/2), 


21 CALC-906566422, CALC-703457064, and CALC-1755 14843. 
22 CaLc-554253147. 
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and this implies |v| > 2.696. We have”? 


2 2 2 2 
Y*o(Qi) < })s-voro(Qi) + 0.01 (7/2 — dih(Q;)) < } > s-voro(Qi). 


i=l i=1 i=l i=l 


Inequality (10.7) holds, for V3 = Vp. 

In the general case, we run over all enclosed vertices v and truncate around each vertex. 
For each vertex we obtain inequality (10.5) or (10.7). These inequalities can be coherently 
combined over multiple enclosed vertices because the V-faces were associated with 
different vertices v and none of the Rogers simplices used in the terms anc() overlap. More 
precisely, if Z is a set of enclosed vertices, set Vf = inex Vexand vie = Ve ri Ve: 
Coherence means that we obtain valid inequalities by adding the superscript Z to Vp 
and V> in inequalities (10.5) and (10.7), if v ¢ Z. In sum, op(D) < voro,p(D). O 
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This paper is the fourth of six papers giving the Kepler Conjecture proof. It contains 
Chapters 11-14 of the proof. 


An addendum to the proof, which appeared in 2010, requires adding a new sub- 
section to this paper, which would be the following: 


12.14. Biconnected graphs 


The material for this subsection appears as Section 8 of the paper T. C. Hales et al., 
“A Revision of the Proof of the Kepler Conjecture.” This appears in Section II on 
pp. 361-371 of this volume. 


The asterisks added in the margin of some pages indicate that changes are required, 
as listed in the Hales list of errata given on p. 373 of this volume. In addition, on 
p. 224 of this volume, dioop (4, 3) = 0.29426, as given on p. 39 of the 1998 preprint 
T. C. Hales, The Kepler Conjecture, arXiv:math/9811078v2. 
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Abstract. This paper is the fourth ina series of six papers devoted to the proof of the Kepler 
conjecture, which asserts that no packing of congruent balls in three dimensions has density 
greater than the face-centered cubic packing. In a previous paper in this series, a continuous 
function f on acompact space was defined, certain points in the domain were conjectured to 
give the global maxima, and the relation between this conjecture and the Kepler conjecture 
was established. The function f can be expressed as a sum of terms, indexed by regions on 
a unit sphere. In this paper detailed estimates of the terms corresponding to general regions 
are developed. These results form the technical heart of the proof of the Kepler conjecture, 
by giving detailed bounds on the function f. The results rely on long computer calculations. 


Introduction 


This paper contains the technical heart of the proof of the Kepler conjecture. Its primary 
purpose is to obtain good bounds on the score or(D) when R is an arbitrary standard 
region of a decomposition star D. This is particularly challenging, because we have no 
a priori restrictions on the combinatorial type of the standard region R. It is not known 
to be bounded by a simple polygon. It is not known to be simply connected. Moreover, 
there are multitudes of possible geometrical configurations of upright and flat quarters, 
each scored by a different rule. This paper deals with these complexities and bounds the 
score Op(D) in a way that depends on a simple numerical invariant n(R) of R. When R 
is bounded by a simple polygon, the numerical invariant is simply the number of sides 
of the polygon. This bound on the score of a standard region represents the turning point 
of the proof, in the sense that it caps the complexity of a contravening decomposition 
star, and restrains the combinatorial possibilities. Later in the proof it is instrumental in 
the complete enumeration of the plane graphs attached to contravening stars. 

The first section proves a series of approximations for the score of upright quarters. 
The strategy is to limit the number of geometrical configurations of upright quarters by 
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showing that a common upper bound (to the scoring function) can be found for quite 
disparate geometrical configurations of upright quarters. When a general upper bound 
can be found that is independent of the geometrical details of upright quarters, we say 
that the upright quarters can be erased. (A precise definition of what it means to erase an 
upright quarter appears below.) There are some upright quarters that cannot be treated 
in this manner; and this adds some complications to the proofs in this paper. 

The second section states the main result of the paper (Theorem 12.1). An initial 
reduction reduces the proof to the case that the boundary of the given standard region is 
a polygon. A further argument is presented to reduce the proof to a convex polygon. 

The third section completes the proof of the main theorem. This part of the proof 
relies on anew geometrical decomposition of the part of a V-cell over a standard region. 
The pieces in this decomposition are called truncated corner cells. 

A final section in this paper collects miscellaneous further bounds that will be needed 
in later parts of the proof of the Kepler conjecture. 


11. Upright Quarters 
11.1. Erasing Upright Quarters 


Definition 11.1. A standard region is said to be exceptional if it is not a triangle or 
a quadrilateral. The pair (D, R) consisting of a decomposition star and an exceptional 
standard region is said to be an exceptional cluster. The vertices of the packing of height 
at most 2f9 that are contained in the closed cone over the standard region are called its 
corners. 


Fix an exceptional cluster R. Throughout this paper, we assume that R lies on a star 
of score at least 8 pt. It is to be understood, when we say that a standard region does not 
exist, that we mean that there exists no such region on any star scoring more than 8 pt. 

In this section we discuss how to eliminate many cases of upright diagonals. The 
results are summarized in Section 11.9. 

If R is a standard region, we write Vr(t) for the intersection of the local V-cell 
Vr = VC(0O)N C(R) with a ball B(t), centered at the origin, of radius t. We usually take 
t = 1. If {0, v}, of length between 2% and 2/2, is not the diagonal of an upright quarter 
in the Q-system, then v does not affect the truncated cell Vr (to) and may be disregarded. 
For this reason we confine our attention to upright diagonals that lie along an upright 
quarter in the Q-system. 

We say that an upright diagonal {0, v} can be erased with penalty mp => 0, if we have, 
in terms of the decomposition of Section 9, 


Y\o(Q) +} o(Vs(ts)) — 48cct VOl(Sp(v)) < 119 + )\s-voro(Q) + } > s-voro(S). 
Q S O S 
Here the sum over Q runs over the upright quarters around {0, v}. The scores o(Q) are 
context-dependent (see Section 7). The second sum runs over simplices S along {0, v} 
of type C in the S-system. We define their score o (Vs(ts)) as in Section 9. Also, 5p(v) 
is the piece of the decomposition defined in Section 9. The right-hand side is scored by 
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the truncation function in Section 7 (formula (7.13)). When we erase without mention 
of a penalty, 7 = 0 is assumed. 

If the diagonal can be erased, an upper bound on the score is obtained by ignoring the 
upright diagonal and all of the structures around it coming from the decomposition of 
Section 9, and switching to the truncation at fo. The current section shows that various 
vertices can be erased, and this will greatly reduce the number of combinatorial possi- 
bilities for an exceptional cluster. 


11.2. Contexts 


Each upright diagonal has a context (p,q), with p the number of anchors and p — q 
the number of quarters around the diagonal (Definition 7.1). The dihedral angle of a 
quarter is less than‘ zr, so the context (2, 0) is impossible. There is at least one quarter, 
sop>q+1,p=2. 

The context (2, 1) is treated in Section 10.4. Lemma 10.15 shows that by removing 
the upright diagonal, and scoring the surrounding region by a truncated function voro, 
an upper bound on the score is obtained. In the remaining contexts, p > 3. We start 
with contexts satisfying p = 3. The context (3, 0) is to be regarded as two quasi-regular 
tetrahedra sharing a face rather than as three quarters along a diagonal. In particular, by 
Definition 4.8, the upright quarters do not belong to the Q-system. 

We recall that the score of an upright quarter is given by 


o(Q, v) = (u(Q, v) + H(Q, b) + s-voro(Q, v) — s-voro(Q, B))/2, 


except in the contexts (2, 1) and (4, 0). Define v(Q) to be the right-hand side of this 
equation. The context (2, 1) has been treated, and the context (4, 0) does not occur in 
exceptional clusters. Thus, for the remainder of this section, the scoring rule o(Q) = 
v(Q) is used. 

We have several different variants on the score depending on the truncation, analytic 
continuation, and so forth. If f is any of the functions 


S-vory, s-vor, I’, v, 
we set To, Ty, Tr, Ty, respectively, to 
t = —f(S) + sol(S)é pt. 


We set T(S, tf) = — s-vor(S, t) + sol($)¢ pt. The family of functions t,, measure what is 
squandered by a simplex. We say that Q has compression type or Voronoi type according 
to the scoring of w(Q). (See Section 7.1.) 

Crowns and anchor correction terms are used in Section 10.4 to erase upright quarters. 
We imitate those methods here. The functions crown and anc are defined and discussed 
in Section 10.4. If S = S(y1,..., ye) is a simplex along {0, v}, set 


K(S(y1, +--+ Y6)) = crown(y; /2) dih(S)/(27) + anc(y1, y2, Yo) + ane(y1, 3, Ys). 


24 CALC-971555266. 
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«(S) is abound on the difference in the score resulting from truncation around v. Assume 
that S is the simplex formed by {0, v} and two consecutive anchors around {0, v}. Assume 
further that the circumradius of S is at least no(y1/2). Then we have 


K(S) = —4doct Vol(dp(W*)), 


where W° is the extended wedge constructed in Section 9.2. To see this, it is a matter of 
interpreting the terms in «. The function crown enters the volume through the region over 
the spherical cap Do of Section 9.2, lying outside B(to). By multiplying by dih(S)/(277), 
we select the part of the spherical cap over the unextended wedge W between the anchors. 
The terms anc adjust for the four Rogers simplices lying above the extension W°. 


11.3. Three Anchors 
Lemma 11.2. The upright diagonal can be erased in the context (3, 2). 


Proof. Let v; and v2 be the two anchors of the upright diagonal {0, v} along the quarter. 
Let the third anchor be v3. 

Assume first that |v| > 2.696. If Q is of compression type, then”? the score is 
dominated by the truncated function s-vorp. Assume Q is of Voronoi type. If |v], |v2| < 
2.45, then a calculation”® gives the result. Take |v2| > 2.45. By symmetry, |v — v;| or 
|v — v2| > 2.45. The case |v — v| > 2.45 is treated by another calculation.*” We take 
|v — v2| > 2.45. Let S = {0, v, v2, v3}. If S is of type C, the result follows.”® S$ is of type 
C, if and only if y4 < 2.77 (because n456 > (2.45, 2, 2.77) > /2). If S is not of type 
C, we argue as follows. The function h?(n(2h, 2.45, 2.45)~? — no(h)~*) is a quadratic 
polynomial in h? with negative values for 2h € [2.696, 2\/2]. From this we find 


rad($) > n(2h, 2.45, 2.45) > no(h), where 2h = |vl, 
and this justifies the use of « (see Section 9.2, Case (2)). That the truncated function 
dominates the score now follows from a calculation.7° 
Now assume that |v| < 2.696. If the simplices {0, v, v1, v3} and {0, v, v2, v3} are of 
type C, the bound follows from a calculation.*°:*! If say S = {0, v, v2, v3} is not of type 
C, then 
rad($) > J/2> no (2.696/2) > no(h), 


justifying the use of «. The bound follows from further calculations.**-*4 (TC + « < 
octavorg, etc.) 


25 CALC-73974037. 
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Lemma 11.3. The upright diagonal can be erased in the context (3, 1), provided the 
three anchors do not form a flat quarter at the origin. 


Proof. Inthe absence of a flat quarter, truncate, score, and remove the vertex v as in the 
context (3, 1) of Lemma 10.15. If there is a flat quarter, by the rules of Definition 4.8, v 
is enclosed over the flat quarter. We do nothing further with them for now. This unerased 
case appears in the summary, Section 11.9. See Lemma 11.27. 


11.4. Six Anchors 
Lemma 11.4. An upright diagonal has at most five anchors. 


Proof. The proof relies on constants and inequalities from two calculations.*>:*© If 


between two anchors there is a quarter, then the angle is greater than 0.956, but if there 
is not, the angle is greater than 1.23. So if there are k quarters and at least six anchors, 
they squander more than 


k(1.01104) — [22 — (6 — k)1.23]0.78701 > (4 ¢ — 8) pt, 


fork > 0. 


11.5. Anchored Simplices 


Let {0, v} be an upright diagonal, and let v;, v2,..., vy = v; be its anchors, ordered 
cyclically around {0, v}. This cyclic order gives dihedral angles between consecutive 
anchors around the upright diagonal. We define the dihedral angles so that their sum is 
27, even though this will lead us to depart from our usual conventions by assigning a 
dihedral angle greater than 2 when all the anchors are concentrated in some half-space 
bounded by a plane through {0, v}. When the dihedral angle of S = {0, v, v;, v;41} is 
at most 7, we say that S is an anchored simplex if |v; — vj41| < 3.2. (The constant 
3.2 appears throughout this section.) All upright quarters are anchored simplices. If an 
upright diagonal is completely surrounded by anchored simplices, the upright diagonal 
is sometimes called a loop. If |v; — vj41| > 3.2 and the angle is less than 7, we say there 
is a large gap around {0, v} between v; and v;+). 

To understand how anchored simplices overlap we need a bound satisfied by vertices 
enclosed over an anchored simplex. 


Lemma 11.5. A vertex w of height between 2 and 2/2, enclosed in the cone over an 


anchored simplex {0, v, v1, V2} with diagonal {0, v} satisfies |w —v| < 2to. In particular, 
if |w| < 2%, then w is an anchor. 


35 CALC-729988292. 
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Proof. As in Lemma 4.16, the vertex w cannot lie inside the anchored simplex. If 
|v; — vo] < 2/2, the result follows from Lemma 5.16. In fact, if Jw] < o02. the 
Voronoi cells at 0 and w meet, so that Lemma 5.16 forces {0, v1, v2, w} to be a quarter. 
(This observation gives a second proof of Lemma 4.34.) 

Assume that a figure exists with |v; — v2| > 2/2. Suppose for a contradiction that 
|v — w| > 2%. Pivot v; around {0, v2} until |v — vy] = 2f and v2 around {0, v;} until 
|v — vo| = 2t. Rescale w so that |w| = 2/2. Set x = |v, — vy|. If, through geometric 
considerations, w is not deformed into the plane of {0, v2, v;}, then we are left with 


the one-dimensional family |w’| = |w’ — w| = 2, for w’ = vo, vj, |v — w| = |v| = 
|v; — v| = |v2 — v| = 2f9, depending on x. This gives a contradiction 
mw > dih(v2, v1, 0, v) + dih(v2, v1, v, w) 


= 2dih(S(x, 2, 2to, 2%, 2, 2)) > 7, 


for x > 2/2. (Equality is attained if x = 2/2.) 

Thus, we may assume that w lies in the plane P = {0, vj, v2}. Take the circle in P 
at distance 2) from v. The vertices 0 and w lie on or outside the circle. The vertices v; 
and v> lie on the circle, so the diameter is at least x > 2,/2. The distance from v to P is 
less than xp = , [212 — 2. The edge {0, w} cannot pass through the center of the circle, 
because |w| is less than the diameter. Reflect v through P to get v’. Then |v — v’| < 2x9. 
Swapping v,; and v2 as necessary, we may assume that w is enclosed over {0, v, v’, v2}. 
The desired bound |v — w| < 2% now follows from geometric considerations and the 
contradiction 


2/2 = |w| > E(S(2, 2t0, 2tp, 2X0, 2t0, 2t0), 2, 2t, 2to) = 2/2. 


Corollary 11.6. A vertex of height at most 2t9 is never enclosed over an anchored 
simplex. 


Proof. If so, it would be an anchor to the upright diagonal, contrary to the assumption 
that the anchored simplex is formed by consecutive anchors. O 


11.6. Anchored Simplices Do Not Overlap 


Definition 11.7. Consider an upright diagonal that is not a loop. Let R be the standard 
region that contains the upright diagonal and its surrounding quarters. Assume we are 
in the context (4, 1) or (5, 1). In the context (4, 1), suppose that there does not exist a 
plane through the upright diagonal such that all three quarters lie in the same half-space 
bounded by the plane. Then we say that the context is 3-unconfined. If such a plane 
exists, we say that the context is 3-crowded. We call the context (5, 1) a 4-crowded 
upright diagonal. Sections 11.3 and 11.4 reduce everything to contexts with four or five 
anchors around each vertex. If there are five anchors, Lemma 11.14 and Remark 11.13 
show that we can assume at most one large gap. This gives contexts (5,0) and (5, 1). 
If there are four anchors, then Lemma 11.21 will dismiss all contexts except (4, 0) and 
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(4, 1). Thus, every upright diagonal is exactly one of the following: a loop, 3-unconfined, 
3-crowded, or 4-crowded. 


Definition 11.8. The Cayley—Menger determinant expresses the volume of a simplex 


S(yl,..., ye) in the form /A(x,...,%6)/12, where x; = ye, and A is a polynomial ake 


with integer coefficients. The polynomial A will be used frequently. 


This lemma is a consequence of the two others that follow. The context of the lemma 
is the set of anchored simplices that have not been erased by previous reductions. 


Lemma 11.9. Anchored simplices do not overlap. 


The remaining contexts have four or five anchors. Let w and the anchored simplex 
S = {0, v, v1, v2} be as in Section 11.5. Our object is to describe the local geometry 
when an upright diagonal is enclosed over an anchored simplex. If |v; — v2| < 2V2, we 
have seen in Lemma 4.32 that there can be no enclosed upright diagonal with four or 
more anchors over the anchored simplex S. 


Assume |v, — v2| > 2/2. Let W ,..-+, We, k = 4, be the anchors of {0, w}, indexed 
consecutively. The anchors of {0, w} do not lie in C(‘S), and the triangles {0, w, w;} and 
{0, v, vj} do not overlap. Thus, the plane {0, v;, v2} separates w from {w 1, ..., wx}. Set 


S; = {0, w, w;, wi41}. By a calculation*’ 
m > dih(S;) +---+dih(S,_,) > (k — 1)0.956. 


Thus, & = 4. The common upright diagonal of the three simplices {5;} is 3-crowded. 
We claim that {v;, v2} = {w;, wa}. Suppose to the contrary that, after reindexing as nec- 
essary, So = {0, w, w1, v1} is a simplex, with v; 4 wy, that does not overlap S;,..., 53. 
Then z > dih(So) +--- + dih(S3). So 0.28 > 2 — 3(0.956) > dih(So). A calculation” 
now implies that |w — “ail > 2/2. 

Assume that {0, w, v1, v2} are coplanar. Disregard the other vertices. We minimize 
|v; — v2| when 


|w| = 2V2, |v2| = |v] = |w — vo| = 2, |w — v,| = 2V2. 


This implies 3.2 > |v, — v2| > x, where x is the largest positive root of the polynomial 
A(8, 4, 4, x2, 4, 8). However, x ~¥ 3.36, a contradiction. 
Since {0, w, v1, v2} cannot be coplanar vertices, geometric considerations apply and 
2/2 > wl = E080, 2,2, 2,2 3.0), 04/2, 2,2). > 26/2, 


This contradiction establishes that vj = w 1. 


Lemma 11.10. Around a 3-crowded upright diagonal, all of the anchored simplices 
are quarters. 


37 CALC-83777706. 
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Proof. The proof makes use of constants and inequalities from several different calcu- 
lations.°°~*! The dihedral angles are at most 2 — 2(0.956) < 1.23. This forces y4 < 2f0, 
for each simplex S. So they are all quarters. 


Lemma 11.11. Jf there is 3-crowded upright diagonal, then the three anchored sim- 
plices squander more than 0.5606 and score at most —0.4339. 


Proof. The proof makes use of constants and inequalities from several different 
calculations.*74 The three anchored simplices squander at least 


3(1.01104) — 2(0.78701) > 0.5606. 


The bound on the score follows similarly from v < —0.9871 + 0.80449 dih. 


Lemma 11.12. Jf a simplex at a 3-crowded upright diagonal overlaps an anchored 
simplex, the decomposition star does not contravene. 


Proof. Suppose that {0, v, v;, v2} is an anchored simplex that another anchored simplex 
overlaps, with {0, v} the upright diagonal. Let {0, w} be a 3-crowded upright diagonal. 
We score the two simplices Si = {0, v, w, v;} by truncation at /2. Truncation at /2 is 
justified by face-orientation arguments or by geometric considerations: 


ESO. Qin, Diy, 25. ths Dia), 2. 22) = OW 3. 
A calculation* gives 
ty (Si, V2) + ty (85, V2) => 20.13) + 0.2(dih(S{) + dih(S,) — 7) > 0.26. 


Together with the three simplices around the 3-crowded upright diagonal that squander 
at least 0.5606, we obtain the stated bound. 


11.7. Five Anchors 


When there are five anchors of an upright diagonal, each dihedral angle around the 
diagonal is at most 277 — 4(0.956) < z. 


Remark 11.13. There are at most two large gaps by the calculation*® 


3(1.65) + 2(0.956) > 27. 


39 CALC-815492935. 
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Lemma 11.14. /f an upright diagonal has five anchors with two large gaps, then the 
three anchored simplices squander > (4a ¢ — 8) pt. 


Proof. By a calculation,*’ the anchored simplices are all quarters, 1.23 + 2(1.65) + 
2(0.956) > 27r. The dihedral angle is less than 277 — 2(1.65). The linear programming 
bound based on various inequalities** is greater than 0.859 > (47¢ — 8) pt. oO 


Definition 11.15. Define a masked flat quarter to be a flat quarter that is not in the 
Q-system because it overlaps an upright quarter in the Q-system. They can only occur 
in a very special setting. 


Lemma 11.16. Let {0, v} be an upright diagonal with at least four anchors. If Q is a 
flat quarter that overlaps an anchored simplex along {0, v}, then the vertices of Q are 
the origin and three consecutive anchors of {0, v}. 


Proof. For there to be overlap, the diagonal {w;, w2} of Q must pass through the face 
{0, v, v;} formed by some anchor v; (see Lemma 4.19). By Lemma 4.24, w; and wy are 
anchors of {0, v}. By Lemma 4.32, w2, vj, and w, are consecutive anchors. If v; is a 
vertex of Q we are done. Otherwise, let w3 4 0, w;, w2 be the remaining vertex of Q. 
The edges {v, v;} and {v;, 0} do not pass through the face {w1, w2, w3} by Lemma 4.19. 
Likewise, the edges {w2, w3} and {w3, w;} do not pass through the face {0, v, v;}. Thus, 
v is enclosed over the quarter Q. 

Let ws A Wj, V1, W2 be a fourth anchor of {0, v}. By Lemma 4.19, we have 
Ww, = wW3. Oo 


Corollary 11.17 (of the proof). Jf v is enclosed over a flat quarter, then {0, v} has at 
most four anchors. 


When we are unable to erase the upright diagonal with five anchors and a large gap, 
we are able to obtain strong bounds on the score. 


Lemma 11.18. Suppose an upright diagonal in a decomposition star has five anchors 
and one large gap. The four anchored simplices score at most —0.25. The four anchored 
simplices squander at least 0.4. If any of the four anchored simplices is not an upright 
quarter then the decomposition star does not contravene. 


Proof. A list of inequalities? together with”’ dih > 1.65 give the bound —0.25. Further 
inequalities*! give the bound 0.4. To get the final statement of the lemma, we again use 
a series of inequalities.°?->* O 
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Corollary 11.19. There is at most one 4-crowded upright diagonal in a contravening 
decomposition star. 


Proof. The crown along the large gap, with the bound of the lemma, gives** 0.4 —« > 
0.4 + 0.02274 squandered by the upright quarters around a 4-crowded upright diagonal. 
The rest squanders a positive amount (see Lemma 9.20). If there are two 4-crowded 
upright diagonals, use 2(0.4 + 0.02274) > (4¢ — 8) pt. 


Definition 11.20. Weset rp = 0.01561, €y = 0.003521, &, = 0.00935, & = —0.029, 
and & rp = & + &p = —0.01339. 

The first two constants appear in calculations*>’*° as penalties for erasing upright 
quarters of compression type, and Voronoi type, respectively. &. is an improved bound 
on the penalty for erasing when the upright diagonal is at least 2.57. Also, &, is an upper 
bound?’ on «, when the upright diagonal is at most 2.57. If the upright diagonal is at 
least 2.57, then we still obtain the bound>® &, - = —0.02274 + &/ on the sum of « with 
the penalty from erasing an upright quarter. 


11.8. Four Anchors 


Lemma 11.21. /f there are at least two large gaps around an upright diagonal with 
four anchors, then it can be erased. 


Proof. There are at least as many large gaps as upright quarters. Each large gap drops 
us by &, and each quarter lifts us by at most*?~*! &p. We have &  < 0. 


Remark 11.22. Let {0, v} be an enclosed vertex over a flat quarter. Then 
jul > E(2, 2, 2, 2tp, Qty, 2/2, 2, 2, 2) > 2:6. 

If an edge of the flat quarter is sufficiently short, say ye < 2.2, then 
[i] = EC, 2, 2,22, Pty, 20/2, 2,22) > 27. 


The two dihedral angles on the gaps are > 1.65. If the two quarters mask a flat quarter, 
we use the scoring of 2(c) in Section 11.9. We have 0.0114 < —2&, p. 
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When there is one large gap, we may erase with a penalty 77) = 0.008. 


Lemma 11.23. Let v be an upright diagonal with four anchors. Assume that there is 
one large gap. The anchored simplices can be erased with penalty mp = 0.008. If any 
of the anchored simplices around v is not an upright quarter then we can erase with 
penalty my = 0.00222. 

Moreover, if there is a flat quarter overlapping an upright quarter, then one of the 
following holds: 


(1) The truncated function s-vorg exceeds the score by at least 0.0063. The diagonal 
of the flat is at least 2.6, and the edge opposite the diagonal is at least 2.2. 

(2) The truncated function exceeds the score by at least 0.0114. The diagonal of the 
flat is at least 2.7, and the edge opposite the diagonal is at most 2.2. 


Definition 11.24. Let a 3-unconfined upright diagonal be an upright diagonal that 
has four anchors and one large gap in a situation where there is no masked flat 
quarter. 


Proof. The constants and inequalities used in this proof can be found in a series of 
calculations.°?~+ 

First we establish the penalty 0.008. The truncated function s-vorg is an upper bound 
on the score of an anchored simplex that is not a quarter. By these inequalities, the result 
follows if the diagonal satisfies y, > 2.57. 

Take y, < 2.57. If any of the upright quarters are of Voronoi type, the result follows 
from (&.r + & < 0.008). If the edges along the large gap are less than 2.25, the result 
follows from (—0.03883 + 3& = 0.008). If all but one edge along the large gap are less 
than 2.25, the result follows from (—0.0325 + 2&p + 0.00928 = 0.008). 

If there are at least two edges along the large gap of length at least 2.25, we con- 
sider two cases according to whether they lie on a common face of an upright quar- 
ter. The same group of inequalities gives the result. The bound 0.008 is now fully 
established. 

Next we prove that we can erase with penalty 0.00222, when one of the anchored 
simplices is not a quarter. If |v| > 2.57, then we use 


2ép + Ey + & < 0.00935 + 0.003521 — 0.2274 < 0. 


If |v| < 2.57, we use 
2(0.01561) — 0.029 < 0.00222. 
Let v; ..., v4 be the consecutive anchors of the upright diagonal {0, v} with {v;, va} 


the large gap. Suppose |v; — v3| < 2/2. 
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We claim the upright diagonal {0, v} is not enclosed over {0, v1, v2, v3}. Assume the 
contrary. The edge {v;, v3} passes through the face {0, v, v4}. Disregarding the vertex 
v2, by geometric considerations, we arrive at the rigid figure 


|v| = 22, [vi] = |v; — v| = |v — v3] = |v3| = |v3 — v4] = 2, 


|v — vg] = |v4| = 2f0, |v) — v4] = 3.2. 
The dihedral angles of {0, v, v;, v4} and {0, v, v3, v4} are 
dih(S(2V2, 2, 2to, 3.2, 2to, 2)) > 2.3, dih(S(2V2, 2, 2ty, 2, 2to, 2)) > 1.16. 


The sum is greater than zr, contrary to the claim that the edge {v;, v3} passes through the 
face {0, v, v4}. (This particular conclusion leads to the corollary cited at the end of the 
proof.) Thus, {v;, v3} passes through {0, v, v2} so that the simplices {0, v, v1, v2} and 
{0, v, v2, v3} are of Voronoi type. 

To complete the proof of the lemma, we show that when there is a masked flat quarter, 
either (1) or (2) holds. Suppose we mask a flat quarter Q’ = {0, v1, v2, v3}. We have 
established that {v;, v3} passes through the face {0, v, v2}. To establish (1) assume that 
|v2| > 2.2. The remark before the lemma gives 


(vp —w3| = ESC, 2, 2, 2/2, 2ip, 2%), 2, 2, 2) > 2.6. 
The bound 0.0063 comes from 
Erp + 2éy < —0.0063 
To establish (2) assume that |v2| < 2.2. The remark gives 
(vy ws| S ECS, 2, 2, 20/9, 2.2, 2tn), 2,2, 2) = 2.7. 
If the simplex {0, v, v3, v4} is of Voronoi type, then 
& + 3&y < —0.0114. 
Assume that {0, v, v3, v4} is of compression type. We have 
— 0.004131 + & r +é&y < —0.0114. Oo 
Corollary 11.25 (of the proof). Jf there are four anchors and if the upright diagonal 
is enclosed over a flat quarter, then there are four anchored simplices and at least three 
quarters around the upright diagonal. 


11.9. Summary 


The following index summarizes the cases of upright quarters that have been treated 
in this section. If the number of anchors is the number of anchored simplices (no 
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large gaps), the results appear in Section 13.12. Every other possibility has been 


treated. 
e None, one, or two anchors. (Section 11.2.) 
e Three anchors: (Section 11.3.) 


— context (3, 0), 
— context (3, 1), 
— context (3, 2), 
— context (3, 3). 


e Four anchors. (Section 11.8.) 


— No gaps (Section 13.12), 
— One gap, 
— Two or more gaps. 


e Five anchors: (Section 11.7), 


— No gaps (Section 13.12), 
— One gap (4-crowded), 
— Two or more gaps. 


e Six or more anchors. (Section 11.4.) 


By truncation and various comparison lemmas, we have entirely eliminated upright 
diagonals except when there are between three and five anchors. We may assume that 
there is at most one large gap around the upright diagonal. 


1. 


Ww 


Consider an anchored simplex Q around a remaining upright diagonal. The score 
is v(Q) if Q is a quarter, the analytic function s-vor(Q) if the simplex is of type 
C (Section 9.4), and the truncated function s-voro(Q) otherwise. 


. Consider a flat quarter Q in an exceptional cluster. An upper bound on the score 


is obtained by taking the maximum of all of the following functions that satisfy 

the stated conditions on Q. Let y,4 denote the length of the diagonal and y, be the 

length of the opposite edge. 

(a) The function w(Q). 

(b) s-voro(Q) — 0.0063, if y4 > 2.6 and y,; > 2.2. (Lemma 11.23.) 

(c) s-voro(Q) — 0.0114, if y4 > 2.7 and y; < 2.2. (Lemma 11.23.) 

(d) v(Q,) + v(Q2) + s-vor, (S), if there is an enclosed vertex v over Q of height 
between 2f9 and 2/2 that partitions the convex hull of (Q, v) into two upright 
quarters Q;, Q» and a third simplex S. Here s-vor, = s-vor if S is of type C, 
and s-vor, = S-vorg otherwise. (Lemma 11.3.) 

(e) s-vor(Q, 1.385) if the simplex is of type B (Section 9.4.) 

(f) s-voro(Q) if the simplex is an isolated quarter with max(y2, y3) > 2.23, 
y4 = 2.77, and nas5 => V2. 


. If S is a simplex is of type A, its score is s-vor(S). (Section 9.4.) 
. Everything else is scored by the truncation vorg. Formula (7.13) is used on these 


remaining pieces. On top of what is obtained for the standard cluster by summing 
all these terms, there is a penalty 7) = 0.008 each time a 3-unconfined upright 
diagonal is erased. 


. The remaining upright diagonals that are not completely surrounded by anchored 


simplices are 3-unconfined, 3-crowded, or 4-crowded from Sections 11.6-11.8. 
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11.10. Some Flat Quarters 


Recall that &y = 0.003521, > = 0.01561, and &, = 0.00935. They are the penalties 
that result from erasing an upright quarter of Voronoi type, an upright quarter of com- 
pression type, and an upright quarter of compression type with diagonal > 2.57. (See 
calculations.®»%) 

In the next lemma we score a flat quarter by any of the functions on the given domains 


r, N34, Nase < V2, 
S-vor, 1234 = J2, 

6 = 3s-voro, y4 > 2.6, yy > 2.2, 
S-voro, y4 > 2.7, 


s-vorp, Naso > V2. 


Lemma 11.26. G is an upper bound on the functions in 2(a)—-(f) of Section 11.9. That 
is, each function is dominated by some choice of 6. 


Proof. The only case in doubt is the function of 3.10(d): 


v(Q1) + v(Q2) + s-vor; (S). 


This is established by the following lemma. 


We consider the context (3, 1) that occurs when two upright quarters in the Q-system 
lie over a flat quarter. Let {0, v} be the upright diagonal, and assume that {0, v1, v2, v3} 
is the flat quarter, with diagonal {v2, v3}. Let o denote the score of the upright quarters 
and other anchored simplex lying over the flat quarter. 


Lemma 11.27. o < min(0, s-vorg). 


Proof. The bound of 0 is established in Theorem 8.4. 
By acalculation,®’ if |v| > 2.69, then the upright quarters satisfy 


v < S-vorg +0.01 (7/2 — dih), 


so the upright quarters can be erased. Thus we assume without loss of generality that 
|v| < 2.69. 
We have 


|v) = ECS, 2, 2, 2%. Bty, 24/2), 2, 2. 2) > 2.6. 


If |vy — v2| < 2.1, or |v; — v3] < 2.1, then 


[ul = ECS, 2,2, 2.1, Bip, 26/2), 2, 2, 2) = 2.72, 
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contrary to assumption. So take |v; — v2| > 2.1 and |v; — v3| > 2.1. Under these 
conditions we have the interval calculation®® v(Q) < s-vorg(Q) where Q is the upright 
quarter. 


Remark 11.28. If we have an upright diagonal enclosed over a masked flat quarter in 
the context (4, 1), then there are three upright quarters. By the same argument as in the 
lemma, the two quarters over the masked flat quarter score < s-vorg. The third quarter 
can be erased with penalty éy. 


Define the central vertex v of a flat quarter to be the vertex for which {0, v} is the 
edge opposite the diagonal. 


Lemma 11.29. < s-vorg +0.0268 for all flat quarters. If the central vertex has 
height < 2.17, then 4 < s-vorg +0.02. 


9 


Proof. This is an interval calculation.® 


We measure what is squandered by a flat quarter by tT = sol Cpt —<. 


Lemma 11.30. Let v be a corner of an exceptional cluster at which the dihedral angle 
is at most 1.32. Then the vertex v is the central vertex of a flat quarter Q in the exceptional 
region. Moreover, t(Q) > 3.07 pt. If G6 = s-voro (and if nas6 > /2), we may use the 
stronger constant T(Q) > 3.07 pt + &y + 2&h. 


Proof. Let S = S(j1,..., ye) be the simplex inside the exceptional cluster centered at 
v, with y; = |v|. The inequality dih < 1.32 gives the interval calculation yy < 2/2, so 
S is a quarter. The result now follows by interval arithmetic.”° 


12. Bounds in Exceptional Regions 
12.1. The Main Theorem 


Let (R, D) be a standard cluster. Let U be the set of corners, that is, the set of vertices 
in the cone over R that have height at most 2%). Consider the set E of edges of length 
at most 2%) between vertices of U. We attach a multiplicity to each edge. We let the 
multiplicity be 2 when the edge projects radially to the interior of the standard region, 
and 0 when the edge projects radially to the complement of the standard region. The 
other edges, those bounding the standard region, are counted with multiplicity 1. 

Let n; be the number of edges in E,, counted with multiplicities. Let c be the number 
of classes of vertices under the equivalence relation v ~ v’ if there is a sequence of edges 
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in E from v to v’. Letn(R) = n, +2(c— 1). If the standard region under R is a polygon, 
then n(R) is the number of sides. 


Theorem 12.1. Let D be a contravening decomposition star. TR(D) > ty, where n = 
n(R) and 


t4 = 0.1317, ts = 0.27113, t6 = 0.41056, 
t7 = 0.54999, tg = 0.6045. 


The decomposition star scores less than 8 pt, if n(R) = 9, for some standard cluster R. 
The scores satisfy oR(D) < Sn, for 5 <n < 8, where 


s5 = —0.05704, So = —0.11408, s7 = —0.17112, Sg = —0.22816. 


Sometimes, it is convenient to calculate these bounds as a multiple of pt. We have 


t4 > 2.378 pt, ts > 4.896 pt, t6 > 7.414 pt, 
t7 > 9.932 pt, tg > 10.916 pt. 


s5 < —1.03 pt, So < —2.06 pt, s7 < —3.09 pt, Sg < —4.12 pt. 
Corollary 12.2. Every standard region is a either a polygon or one shown in Fig. 12.1. 
In the cases that are not (simple) polygons, we call the polygonal hull the polygon 
obtained by removing the internal edges and vertices. We have m(R) < n(R), where the 
constant m(R) is the number of sides of the polygonal hull. 
Proof. By the theorem, if the standard region is not a polygon, then 8 > n; > m > 5. 
(Quad clusters and quasi-regular tetrahedra have no enclosed vertices. See Lemmas 10.4 


and 5.13.) Ifc > 1, then8 > n =n, +2(c—1) > 5+4+2(c—1),soc = 2, andn; = 5,6 
(frames 2 and 5 of the figure). 


CO 
I) 


Fig. 12.1 
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Now take c = 1. Then8 >n >5+(n—m),son—m < 3. Ifn —m = 3, we get 
frame 3. Ifn —m = 2, we have 8 > m+2>5+2,som =5, 6 (frames | and 4). 

However, n — m = 1| cannot occur, because a single edge that does not bound the 
polygonal hull has even multiplicity. Finally, if n — m = 0, we have a polygon. 


Corollary 12.3. /f the type of a vertex of a decomposition star is (7,0), then it does 
not contravene. 


Proof. By Theorem 12.1, if there is a nontriangular region, we have 
t(D) > tip(7, 0) + t4 > (4g — 8) pt. 


Assume that all standard regions are triangular. If there is a vertex that does not lie on 
one of seven triangles, we have, by Lemma 10.5, 


t(D) > tp(7, 0) + 0.55 pt > (4xe — 8) pt. 


Thus, all vertices lie on one of the seven triangles. The complement of these seven 
triangles is a region triangulation by five standard regions. There is some vertex of these 
five that does not lie on any of the other four standard regions in the complement. That 
vertex has type (3, 0), which is contrary to Lemma 10.9. 


12.2. Nonagons 


A few additional comments are needed to eliminate n = 9 and 10, even after the bounds 
to, tio are established. 


Lemma 12.4. Let F be a set of one or more standard regions bounded by a simple 
polygon with at most nine edges. Assume that 


Orf(D) < 589 and tr(D) = bo, 


where so = —0.1972 and to = 0.6978. Then D does not contravene. 


Proof. Suppose that n = 9, and that R squanders at least fy and scores less than so. 
This bound is already sufficient to conclude that there are no other standard clusters 
except quasi-regular tetrahedra (to + t4 > (4 — 8) pt). There are no vertices of type 
(4, 0) or (6, 0): 9 + 4.14 pt > (42g — 8) pt by Lemma 10.5. So all vertices not over the 
exceptional cluster are of type (5, 0). Suppose that there are ¢ vertices of type (5, 0). The 
polygonal hull of R has m < 9 edges. There are m — 2 + 2¢ quasi-regular tetrahedra. If 
£ < 3, then, by Lemma 10.6, the score is less than 


So + (m — 2+ 20) pt — 0.482 pt < 8 pt. 


If, on the other hand, £ > 4, the decomposition star squanders more than 


ty + 4(0.55) pt > (4c — 8) pt. 
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The bound sy will be established as part of the proof of Theorem 12.1. 

The case n = 10 is similar. If € = 0, the score is less than (m — 2) pt < 8 pt, because 
the score of an exceptional cluster is strictly negative, Theorem 8.4. If 2 > 0, we squander 
at least tio + 0.55 pt > (4¢ — 8) pt (Lemma 10.6). 


12.3. Distinguished Edge Conditions 


Take an exceptional cluster. We prepare the cluster by erasing upright diagonals, includ- 
ing those that are 3-unconfined, 3-crowded, or 4-crowded. The only upright diagonals 
that we leave unerased are loops. When the upright diagonal is erased, we score with 
the truncated function vorg away from flat quarters. Flat quarters are scored with the 
function G. The exceptional clusters in Sections 12 and 13 are assumed to be prepared 
in this way. 

A simplex S is special if the fourth edge has length at least 2\/2 and at most 3.2, and 
the others have length at most 27). The fourth edge is called its diagonal. 

We draw a system of edges between vertices. Each vertex will have height at most 2/9. 
The radial projections of the edges to the unit sphere will divide the standard region into 
subregions. We call an edge nonexternal if the radial projection of the edge lies entirely 
in the (closed) exceptional region. 


1. Draw all nonexternal edges of length at most 2\/2 except those between non- 
consecutive anchors of a remaining upright diagonal. These edges do not cross 
(Lemma 4.30). These edges do not cross the edges of anchored simplices (Lem- 
mas 4.22 and 4.24). 

2. Draw all edges of (remaining) anchored upright simplices that are opposite the 
upright diagonal, except when the edge gives a special simplex. The anchored 
simplices do not overlap (Lemma 11.9), so these edges do not cross. These edges 
are nonexternal (Lemmas 11.5 and 4.19). 

3. Draw as many additional nonexternal edges as possible of length at most 3.2 subject 
to not crossing another edge, not crossing any edge of an anchored simplex, and 
not being the diagonal of a special simplex. 


We fix once and for all a maximal collection of edges subject to these constraints. 
Edges in this collection are called distinguished edges. The radial projection of the 
distinguished edges to the unit sphere gives the bounding edges of regions called the 
subregions. Each standard region is a union of subregions. The vertices of height at most 
2to and the vertices of the remaining upright diagonals are said to form a subcluster. 

By construction, the special simplices and anchored simplices around an upright 
quarter form a subcluster. Flat quarters in the Q-system, flat quarters of an isolated 
pair, and simplices of type A and B are subclusters. Other subclusters are scored by the 
function vorg. For these subclusters, formula (7.13) extends without modification. 


12.4. Scoring Subclusters 


The terms of formula (7.13) defining vorp p(D) = vorp(D, to) have a clear geometric 
interpretation as quoins, wedges of to-cones, and solid angles (see Section 7). There is a 
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quoin for each Rogers simplex. There is a somewhat delicate point that arises in connec- 
tion with the geometry of subclusters. It is not true in general that the Rogers simplices 
entering into the truncation vorg, p(D) of (P, D) lie in the cone over P. Formula (7.13) 
should be viewed as an analytic continuation that has a nice geometric interpretation 
when things are nice, and which always gives the right answer when summed over all 
the subclusters in the cluster, but which may exhibit unusual behavior in general. The 
following lemma shows that the simple geometric interpretation of formula (7.13) is 
valid when the subregion is not triangular. 


Lemma 12.5. Jf a subregion is not a triangle and is not the subregion containing the 
anchored simplices around an upright diagonal, the cone of arcradius 


W = arccos(|v|/(2t0)) 


centered along {0, v}, where v is a corner of the subcluster, does not cross out of the 
subregion. 


Proof. For acontradiction, let {v;, v2} be a distinguished edge that the cone crosses. If 
both edges {v, v;} and {v, v2} have length less than 2f9, there can be no enclosed vertex 
w of height at most 2/9, unless its distance from v, and vz is less than 2/0: 


ES O29, 2, 2.39.9) 8,22) = Dh, 


In this case we can replace {v,, v2} by an edge of the subregion closer to v, so without 
loss of generality we may assume that there are no enclosed vertices when both edges 
{u, vi} and {v, v2} have length less than 2/0. 

The subregion is not a triangle, so |v — v;| > 2f0, or |v — v2| > 2f0, say |v—v1| > 200. 
Also |v — v2| > 2. Pivot so that |v; — v2| = 3.2, |v — vy| = 2%, and |v — v2| = 2. (The 
simplex {0, v;, v2, v} cannot collapse (A #4 0) as we pivot. For more details about why 
A # 0, see inequality (12.2) in Section 12.7.) Then use’! By, < dih3. 


As a consequence, in nonspecial standard regions, the terms in the formula (7.13) for 
vorg retain their interpretations as quoins, Rogers simplices, fo-cones, and solid angles, 
all lying in the cone over the standard region. 


12.5. Proof 


The proof of the theorem occupies the rest of the section. The inequalities for triangular 
and quadrilateral regions have already been proved. The bounds on fz, t4, 53, and s4 are 
found in Lemma 10.1, Section 11.1, Lemma 8.10, and Theorem 8.4, respectively. Thus, 
we may assume throughout the proof that the standard region is exceptional 

We begin with a slightly simplified account of the method of the proof. Set tg = 
0.6978, tig = 0.7891, t, = (4¢ — 8) pt, forn > 11. Set D(n, k) = th4, — 0.06585 k, 
forO <k <n,andn+k > 4. This function satisfies 


D(ny, kj) + D(nz, kx) = D(ny +2 — 2, ky + kz — 2). (12.1) 
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In fact, this inequality unwinds to ¢, + 0.13943 > t,41, D(3, 2) = 0.13943, and t, = 
(0.06585)2 + (n — 4) D(3, 2), forn = 4,5, 6, 7. These hold by inspection. 

Call an edge between two vertices of height at most 2/9 /ong if it has length greater 
than 27). Add the distinguished edges to break the standard regions into subregions. We 
say that a subregion has edge parameters (n, k) if there are n bounding edges, where k 
of them are long. (We count edges with multiplicities as in Section 12.1, if the subregion 
is not a polygon.) Combining two subregions of edge parameters (n,, k,) and (nz, kz) 
along a long edge e gives a union with edge parameters (nm; + m2 — 2,k; + ko — 2), 
where we agree not to count the internal edge e that no longer bounds. Inequality (12.1) 
localizes the main theorem to what is squandered by subclusters. Suppose we break the 
standard cluster into groups of subregions such that if the group has edge parameters 
(n, k), it squanders at least D(n, k). Then by superadditivity (formula (12.1)), the full 
standard cluster R must squander D(n, 0) = t,,n = n(R), giving the result. 

Similarly, define constants s4 = 0, so = —0.1972, s, = 0, forn > 10. Set Z(n, k) = 
Snik — ke, for (n,k) A (3,1), and Z(3, 1) = e, where’? ¢ = 0.00005. The function 
Z(n, k) is subadditive: 


Z(n1, ky) + Z(n2, ky) < Z(m1 + nz — 2, ky + kp — 2). 


In fact, this easily follows from sz + 5, < Sa+p—4, for a,b > 4, and e > 0. It will be 


enough in the proof of Theorem 12.1 to show that the score of a union of subregions 
with edge parameters (n, k) is at most Z(n, k). 


12.6. Preparation of the Standard Cluster 


Fix a standard cluster. We return to the construction of subregions and distinguished 
edges, to describe the penalties. Take the penalty of 0.008 for each 3-unconfined upright 
diagonal. Take the penalty 0.03344 = 3&, + &,.- for 4-crowded upright diagonals. Take 
the penalty 0.04683 = 3ép for 3-crowded upright diagonals. Set max = 0.06688. The 
penalty in the next lemma refers to the combined penalty from erasing all 3-unconfined, 
3-crowded, and 4-crowded upright diagonals in the decomposition star. The upright 
quarters that completely surround an upright diagonal (loops) are not erased. 


Lemma 12.6. The total penalty from a contravening decomposition star is at most 


Tmax + 


Proof. Before any upright quarters are erased, each quarter squanders’* > 0.033, so 
the star squanders > (47 ¢ — 8) prif there are twenty-five or more quarters. Assume there 
are at most twenty-four quarters. If the only penalties are 0.008, we have 8(0.008) < 
Imax. If we have the penalty 0.04683, there are at most seven other quarters (0.5606 + 
8(0.033) > (4mr¢ — 8) pt) (Lemma 11.11), and no other penalties from this type or from 
4-crowded upright diagonals, so the total penalty is at most 2(0.008) + 0.04683 < max. 


7 Compare CALC-193836552. 
® CALC-148776243. 
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Finally, if there is one 4-crowded upright diagonal, there are at most twelve other quarters 
(Section 11.7), and erasing gives the penalty 0.03344 + 4(0.008) < amax. 


The remaining upright diagonals are surrounded by anchored simplices. If the edge 
opposite the diagonal in an anchored simplex has length > 2/2, then there may be an 
adjacent special simplex whose diagonal is that edge. Section 13.12 will give bounds on 
the aggregate of these anchored simplices and special simplices. In all other contexts, 
the upright quarters have been erased with penalties. 

Break the standard cluster into subclusters as in Section 12.3. If the subregion is a 
triangle, we refer to the bounds of Section 13.8. Sections 12.7—13.11 give bounds for 
subregions that are not triangles in which all the upright quarters have been erased. 
We follow the strategy outlined in Section 12.5, although the penalties will add certain 
complications. 

We now assume that we have a subcluster without quarters and whose region is not 
triangular. The truncated function vorg is an upper bound on the score. Penalties are 
largely disregarded until Section 13.4. 

We describe a series of deformations of the subcluster that increase vor, p(D) and 
decrease To, p(D). These deformations disregard the broader geometric context of the 
subcluster. Consequently, we cannot claim that the deformed subcluster exists in any 
decomposition star D. As the deformation progresses, an edge {v1, v2}, not previously 
distinguished, can emerge with the properties of a distinguished edge. If so, we add it 
to the collection of distinguished edges, use it if possible to divide the subcluster into 
smaller subclusters, and continue to deform the smaller pieces. When triangular regions 
are obtained, they are set aside until Section 13.8. 


12.7. Reduction to Polygons 


By deformation we can produce subregions whose boundary is a polygon. Let U be 
the set of vertices over the subregion of height < 2/9. As in Section 12.1, the distin- 
guished edges partition U into equivalence classes. Move the vertices in one equiv- 
alence class U; as a rigid body preserving heights until the class comes sufficiently 
close to form a distinguished edge with another subset. Continue until all the vertices 
are interconnected by paths of distinguished edges. vorg and ty are unchanged by these 
deformations. 

If some vertex v is connected to three or more vertices by distinguished edges, it 
follows from the connectedness of the open subregion that there is more than one con- 
nected component U; (by paths of distinguished edges) of U\{v}. Move U U {v} rigidly 
preserving heights and keeping v fixed until a distinguished edge forms with another 
component. Continue until the distinguished edges break the subregions into subregions 
with polygon boundaries. Again vorg and To are unchanged. 

By the end of Section 12, we will deform all subregions into convex polygons. 


Remark 12.7. We will deform in such a way that the edges {v,, v2} will maintain a 


length of at least 2. The proof that distances of at least 2 are maintained is given in 
Section 12.13. 
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We will deform in such a way that no vertex crosses a boundary of the subregion 
passing from the outside to the inside. 


Edge length constraints prevent a vertex from crossing a boundary of the subregion 
from the inside to the outside. In fact, if v is to cross the edge {v;, v2}, the simplex 
S = {0, v1, v, v2} attains volume 0. We may assume, by the argument of the proof of 
Lemma 12.4, that there are no vertices enclosed over S. Because we are assuming that 
the subregion is not a triangle, we may assume that |v— | > 2f0. We have |v| € [2, 2¢0]. 
If v is to cross {v;, v2}, we may assume that the dihedral angles of S along {0, v;} and 
{O, v2} are acute. Under these constraints, by the explicit formulas of Section 8 of [Ha6], 
the vertex v cannot cross out of the subregion 


A(S) > AQ 4,4, 3.27.4, 22) > 0. (12.2) 


We say that a corner v, is visible from another v2 if {v;, v2} lies over the subregion. A 
deformation may make v; visible from v2, making it a candidate for a new distinguished 
edge. If |vj — v2| < 3.2, then as soon as the deformation brings them into visibility 
(obstructed until then by some v), inequality (12.2) shows that |v; — v|, |v2-—v| < 2t9. So 
U1, UV, Uz are consecutive edges on the polygonal boundary, and |v; — v2| > 2,/4 — i > 
./8. By the distinguished edge conditions for special simplices, {v;, v2} is too long to 
be distinguished. In other words, there can be no potentially distinguished edges hidden 
behind corners. They are always formed in full view. 


12.8. Some Deformations 


Definition 12.8. Consider three consecutive corners v3, Vv), V2 of a subcluster R such 
that the dihedral angle of R at v; is greater than zr. We call such a corner concave. (If 
the angle is less than zr, we call it convex.) Similarly, the angle of a subregion is said to 
be convex or concave depending on whether it is less than or greater than zr. 


Let S = S(y1,..-, Yo) = {0, v1, Vo, v3}, y; = |v; |. Suppose that ye > y5.Letx; = y?. 
Lemma 12.9. Ata concave vertex, 0 voto /0X5 > O and 0T)/dx5 < 0. 


Proof. As xs varies, dih;(S) + dih;(R) is constant fori = 1, 2,3. The part of for- 
mula (7.13) for vorg that depends on x5 can be written 


—B(y1) dih(S) — B(y2) dihg(S) — B(y3) dihg(S) — 4doct(quo(Ri35) + quo( R315), 


where B(y;) = A(yi/2) + Go, Ri3s = R(yi1/2,5, 10), Rais = R(y3/2, 0,10), b = 
n(y1, ¥3, ys), and A(h) = (1 — h/to)(@(h, to.) — Go). Set ui35 = u(x, x3, x5) and 
A; = 0A/0x;. (The notation comes from Section 8 of [Ha6] and Section 7 of this issue.) 
We have 
0 quo(R(a, b, c)) —a(c? — b?)3/? 
ab ~ 3b(b2 — a2)'/2 — 
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and db/dxs > 0. Also, u > 0, A => 0 (see Section 8 of [Ha6]). So it is enough to show 
a : : i 
Vo(S) = uissA'/°—— (B(y1) dih(S) + B(y2) diho(S) + B(y3) dihs(S)) < 0. 
5 


By the explicit formulas of Section 8 of [Ha6], we have 


Vo(S) = —B(y1) yi A6 + B(y2) y2135 — B(y3) y3 Ag. 


For Tt, we replace B with B — pt. It is enough to show that 


ViCS) = —(B(1) — Cpt)yi Ae + (BO) — Spt) y2ui3s — (B(y3) — Spt)y3A4 < 0. 


The lemma now follows from an interval calculation. We note that the polynomials 
V; are linear in x4, and xe, and this may be used to reduce the dimension of the 
calculation. O 


We give a second form of the lemma when the dihedral angle of R is less than zr, that 
is, at a convex corner. 


Lemma 12.10. At a convex corner, 0 voto /0x5 < 0 and 0t9/0x5 > 0, if y1, y2, y3 € 
[2, 2t], A > 0, and (i) ya € [2V2, 3.2], ys, yo € [2, 2to], or (ii) v4 = 3.2, ys, ¥6 € 
[2, 3.2]. 


Proof. We adapt the proof of the previous lemma. Now dih; (S$) — dih;(R) is constant, 
for i = 1, 2,3, so the signs change. vorg depends on x5 through 


Y 5 BOi) dili(S) — 48or(quo(Ri3s) + quo(R315))- 


So it is enough to show that 
1/2 a 
Vo — 480A Hag Ol as) + quo(R315)) < 0. 
5 
Similarly, for to, it is enough to show that 
1/2 a 
Vi — 480A igs 5 quot iss) + quo(R315)) < 0. 
5 
By an interval calculation”* 
) 3 
—4Boeitt35 — (quo( Riss) + quo(R315)) < 0.82, on [2, 2t0]”, 
5 


2.05, on [2,2f]°, ys > 2.189. 


The result now follows from the inequalities.” 


74 CALC-984628285. 
75 CALC-984628285. 
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Return to the situation of concave corner v;. Let v2, v3 be the adjacent corners. By 
increasing x5, the vertex vj moves away from every corner w for which {v1, w} lies 
outside the region. This deformation then satisfies the constraint of Remark 12.7. Stretch 
the shorter of {v,, vo}, {vy, v3} until |v; — vo| = |v; — v3] = 3.07 (or until a new 
distinguished edge forms, etc.). Do this at all concave corners. 

By stopping at 3.07, we prevent a corner crossing an edge from the outside to the 
inside. Let w be a corner that threatens to cross a distinguished edge {v,, v2} as a result 
of the motion at a nonconvex vertex. To say that the crossing of the edge is from the 
outside to the inside implies more precisely that the vertex being moved is an endpoint, 
say v,, of the distinguished edge. At the moment of crossing the simplex {0, v;, v2, w} 
degenerates to a planar arrangement, with the radial projection of w lying over the 
geodesic arc connecting the radial projections of v; and v2. To see that the crossing 
cannot occur, it is enough to note that the volume of a simplex with opposite edges of 
lengths at most 2% and 3.07 and other edges at least 2 cannot be planar. The extreme 
case is 


AQ? 2°, Qty 2.2, 307) > 0, 


If |vy| => 2.2, we can continue the deformations even further. We stretch the shorter 
of {v1, v2} and {v,, v3} until |v; — vo] = |v; — v3| = 3.2 (or until a new distinguished 
edge forms, etc.). Do this at all concave corners v; for which |v;| > 2.2. To see that 
corners cannot cross an edge from the outside to the inside, we argue as in the previous 
paragraph, but replacing 3.07 with 3.2. The extreme case becomes 


AG? On 23 SO. 


12.9. Truncated Corner Cells 


Because of the arguments in Section 12.8, we may assume without loss of generality 
that we are working with a subregion with the following properties. If v is a concave 
vertex and w is not adjacent to v, and yet is visible from v, then |v — w| > 3.2. If v is 
a concave corner, then |v — w| > 3.07 for both adjacent corners w. If v is a concave 
corner and |v| > 2.2, then |v — w| > 3.2 for both adjacent corners w. These hypotheses 
will remain in force through to the end of Section 12. 

Recall from Definition 12.8 that we call a spherical region convex if its interior angles 
are all less than 2. The case where the subregion is a convex triangle will be treated in 
Section 13.8. Hence, we may also assume in Sections 12.9-12.12 that the subregion is 
not a convex triangle. 

We construct a corner cell at each corner. It depends on a parameter A € [1.6, 1.945]. 
In all applications, we take A = 1.945 = 3.2 —f, A = 1.815 = 3.07—%, orA = 1.6= 
3.2/2. 

To construct the cell around the corner v, place a triangle along {0, v} with sides 
|u|, to, 4 (with A opposite the origin). Generate the solid of rotation around the axis 
{0, v}. Extend to a cone over 0. Slice the solid by the perpendicular bisector of {0, v}, 
retaining the part near 0. Intersect the solid with a ball of radius fo. The cones over the 
two boundary edges of the subregion at v make two cuts in the solid. Remove the slice 
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that lies outside the cone over the subcluster. What remains is the corner cell at v with 
parameter 1. 

Corner cells at corners separated by a distance less than 2A may overlap. We define 
a truncation of the corner cell that has the property that the truncated corner cells at 
adjacent corners do not overlap. Let {0, v;, v yt denote the plane perpendicular to the 
plane {0, v;, vj} passing through the origin and the circumcenter of {0, v;, uj}. 

Let v;, v2, v3 be consecutive corners of a subcluster. Take the corner cell with param- 
eter 4 at the corner v2. Slice it by the planes {0, v1, v2}+ and {0, v2, v3}, and retain the 
part along the edge {0, v2}. This is the truncated corner cell (tcc). By construction tcc’s at 
adjacent corners are separated by a plane (0, -, -)+. Tcc’s at nonadjacent corners do not 
overlap if the corners are > 2A apart. Tcc’s will only be used in subregions satisfying this 
condition. It will be shown in Section 12.11 that tcc’s lie in the cone over the subregion 
(for suitable i). 


12.10. Formulas for Truncated Corner Cells 


We will assign a score to tcc’s, in such a way that the score of the subcluster can be 
estimated from the scores of the corner cells. 

We write Co for a tcc. We write Cj for the corresponding untruncated corner cell. 
(Although we call this the untruncated corner cell to distinguish it from the corner cell, 
it is still truncated in the sense that it lies in the ball at the origin of radius fo. It is 
untruncated in the sense that it is not cut by the planes (- - -)+.) 

For any solid body X, we define the geometric truncated function by 


vory(X) = 4(—doct VOl(X) + sol(X) /3) 
the counterpart for squander 
& = 
T (X) = Cptsol(x) — vory(X). 


The solid angle is to be interpreted as the solid angle of the cone formed by all rays 
from the origin through nonzero points of X. We may apply these definitions to obtain 
formulas for vor5(Co), and so forth. 

The formula for the score of a tcc differs slightly according to the convexity of the 
corner. We start with a convex corner v, and let v;, v, and v2 be consecutive corners in 
the subregion. 

Let S = {0, v, v1, v2} be a simplex with |v; — v2| > 3.2. The formula for the score 
of a tcc Co(S) simplifies if the face of Co cut by {0, v, v1}+ does not meet the face cut 
by {0, v, v2}+. We make that assumption in this subsection. Set xo(S) = vorg(Co(S)). 
(The function xo is unrelated to the function x that was introduced in Definition 5.14 to 
measure the orientation of faces.) 


w = arc(yj, to, A), h=y,/2, 
Ring = RO1/2, n126, yi/(2 cos w)), Rix = R(y1/2, 17126; to); 
sol'(y1, y2, ¥o) = +dih(Rjo6)(1 — cos w) — sol(Rix6), 
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Xo(S) = dih(S)(1 — cos w) go 
— sol’(y1, y2, ¥6) 0 — Sol’ (y1, y3, Ys) Po 
+ A(h) dih(S) — 48 oct(quo(Rj26) + quo(R135)). 


In the three lines giving the formula for xo, the first line represents the score of the cone 
before it is cut by the planes {0, v, v;}+ and the perpendicular bisector of {0, v}. The 
second line is the correction resulting from cutting the tcc along the planes {0, v, v;}+. 
The face of the Rogers simplex Rj, lies along the plane {0, v, v:}+. The third line is 
the correction from slicing the tcc with the perpendicular bisector of {0, v}. This last 
term is the same as the term appearing for a similar reason in the formula for voro in 
formula (7.13). In this formula R is the usual Rogers simplex and quo(R;;x) is the quoin 
coming from a Rogers simplex along the face with edges (ijk). 

The formula for the untruncated corner cell is obtained by setting “sol’” and “quo” to 
“O” in the expression for xo. Thus, 


vor®(C) = dih(S)[(1 — cos w)go + A(h)]. 


The formula depends only on A, the dihedral angle, and the height |v|. We write Cj = 
Co (\v|, dih), and suppress A from the notation. The dependence on dih(S) is linear: 


7} (Cg (\v|, dih)) = (dih /z)t5 (Cg (lv|, 7). 


The dependence of xo on the fourth edge yg = |v; — v2| comes through a term 
proportional to dih(S). Since the dihedral angle is monotonic in y4, so is xo. Thus, under 
the assumption that |v; — v2| => 3.2, we obtain an upper bound on xo at y4 = 3.2. Our 
deformations will fix the lengths of the other five variables, and monotonicity gives us 
the sixth. Thus, the tcc’s lead to an upper bound on vor; (and a lower bound on 1,) that 
does not require interval arithmetic. 

At a concave vertex, the formula is similar. Replace “dih(S)” with “(22 — dih(S))” 
in the given expression for xp. We add a superscript minus to the name of the function 
at concave vertices, to denote this modification: x, (Co). 


12.11. Containment of Truncated Corner Cells 
The assumptions made at the beginning of Section 12.9 remain in force. 


Lemma 12.11. Let v be a concave vertex with |v| > 2.2. The tcc at v with parameter 
A = 1.945 lies in the truncated V-cell over R. 


Proof. Consider a corner cell at v and a distinguished edge {v,, v2} forming the bound- 
ary of the subregion. The corner cell with parameter A = 1.945 is contained in a cone of 
arcradius 0 = arc(2, tg, 4) < 1.21 < a/2 (in terms of the function arc of Section 9.7). 
Take two corners w 1, w2, visible from v, between which the given bounding edge ap- 
pears. (We may have w; = v;.) The two visible edges, {v, w;}, have length > 3.2. (Recall 
that the distinguished edges at v have been deformed to length 3.2.) They have arc-length 
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at least arc(2fo, 29, 3.2) > 1.38. The segment of the distinguished edge {v,, v2} visible 
from v has arc-length at most arc(2, 2, 3.2) < 1.86. 

We check that the corner cell cannot cross the visible portion of the edge {v1, v2}. 
Consider the spherical triangle formed by the edges {v, w 1}, {v, w2} (extended as needed) 
and the visible part of {v;, v2}. Let C be the radial projection of v and let AB be the 
radial projection of the visible part of {v;, v2}. Pivot A and B toward C until the edges 
AC and BC have arc-length 1.38. The perpendicular from C to AB has length at least 


arccos(cos(1.38) /cos(1.86/2)) > 1.21 > 6. 


This proves that the corner cell lies in the cone over the subregion. O 


Lemma 12.12. Letv be aconcave vertex. The truncated corner cell at v with parameter 
A = 1.815 lies in the truncated V-cell over R. 


Proof. The proof proceeds along the same lines as the previous lemma, with slightly 
different constants. Replace 1.945 with 1.815, 1.38 with 1.316, and 1.21 with 1.1. Re- 
place 3.2 with 3.07 in contexts giving a lower bound to the length of an edge at v, and 
keep it at 3.2 in contexts calling for an upper bound on the length of a distinguished 
edge. The constant 1.86 remains unchanged. 


Lemma 12.13. The truncated corner cells with parameter 1.6 in a subregion do not 
overlap. 


Proof. We may assume that the corners are not adjacent. If a nonadjacent corner w is 
visible from v, then |w — v| > 3.2, and an interior point intersection p is incompatible 
with the triangle inequality: |p — v| < 1.6, |p — w| < 1.6. If w is not visible, we have 
a chain v = vo, U1,..., Uy = w such that v;+; is visible from v;. Imagine a taut string 
inside the subregion extending from v to w. The radial projections of v; are the corners of 
the string’s path. The string bends in an angle greater than z at each v;, so the angle at each 
intermediate v; is greater than zr. That is, they are concave. Thus, by our deformations 
|v; — vj41| > 3.07. The string has arc-length at least r arc(2fo, 2f9, 3.07) > r(1.316). 
However, the corner cells lie in cones of arcradius arc(2,f,A4) < 1. So 201.0) > 
r(1.316), or r = 1. Thus, w is visible from v. 


Lemma 12.14. The corner cell for X < 1.815 does not overlap the to-cone wedge 
around another corner w. 


Proof. We take 4 = 1.815. As in the previous proof, if there is overlap along a chain, 
then 


arc(2, to, A) + arc(2, to, to) > r arc(2fo, 2to, 3.07), 


and again r = 1. So each of the two vertices in question is visible from the other. 
However, overlap implies |p — v| < 1.815 and |p — w| < %, forcing the contradiction 
|w — v| < 3.07. O 
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Lemma 12.15. The corner cell for ) < 1.945 ata corner v satisfying |v| > 2.2 does 
not overlap the ty-cone wedge around another corner w. 


Proof. We take 4 = 1.945. As in the previous proof, if there is overlap along a chain, 
then 


arc(2, to, A) + arc(2, to, to) > r arc(2fo, 2to, 3.2), 


and again r = |. Then the result follows from 


ljw—vl| <|p—vi|t+l|p—w| < 1.9454 t = 3.2. O 


Definition 12.16. By a penalty-free score, we mean the part of the scoring bound that 
does not include any of the penalty terms. We sometimes call the full score, including 
the penalty terms, the penalty-inclusive score. 


Lemma 12.4 was stated in the context of a subregion before deformation, but a cursory 
inspection of the proof shows that the geometric conditions required for the proof remain 
valid by our deformations. (This assumes that the subregion is not a triangle, which we 
assumed at the beginning of Section 12.9.) In more detail, there is a solid C Pp contained 
in the ball of radius f at the origin, and lying over the cone of the subregion P such that 
a bound on the penalty-free subcluster score is vor (C Po) and squander te (C Po). 

Let {y1,..., y-} be a decomposition of the subregion into disjoint regions whose 
union is X. Then if we let C Po(y;) denote the intersection of C Po(y;) with the cone over 
yj, we can write 


1} (C Po) = >» 15 (C Po(yi)). 


These lemmas allow us to express bounds on the score (and squander) of a subcluster 
as a sum of terms associated with individual (truncated) corner cells. By Lemmas 12.1 1— 
12.15, these objects do not overlap under suitable conditions. Moreover, by the interpre- 
tation of terms provided by Section 12.4, the cones over these objects do not overlap, 
when the objects themselves do not. In other words, under the various conditions, we 
can take the (truncated) corner cells to be among the sets C Po(y;). 

To work a typical example, we place a tcc with parameter 1 = 1.6 at each concave 
corner. We place a to-cone wedge Xo at each convex corner. The cone over each object 
lies in the cone over the subregion. By Lemmas 12.5 and 9.20 (see the proof), the fo-cone 
wedge Xo squanders a positive amount. The part P’ of the subregion outside all tcc’s 
and outside the f9-cone wedges squanders 


sol(P’)(¢pt — go) > 0, 


where sol(P’) is the part of the solid angle of the subregion lying outside the tccs. 
Dropping these positive terms, we obtain a lower bound on the penalty-free squander: 


1) (C Po) = >) 1%) (Co). 
Co 


There is one summand for each concave corner of the subregion. Other cases proceed 
similarly. 
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Lemma 12.17. There are at most two concave corners. 


Proof. Use the parameter A = 1.6 and place a tcc Cp at each concave corner v. Let 
Co (\v|, dih) denote the corresponding untruncated cell. The formula of Section 12.10 
gives 


7) (Co) = t (Co (|u|, dih)) — sol’ (y1, yo, yo)q — SOl’(y1, ¥3, Ys) PO. 


where gj = ¢pt—@ < 0.6671. (The conditions ys > 3.07 and ys > 3.07 force the faces 
along the these edges to have circumradius greater than fo, and this causes the “quo” 
terms in the formula to be zero.) 

By monotonicity in dih, alower bound on ch (Co) is obtained at dih=z. to(Cp (|u|, 77)) 
is an explicit monotone decreasing rational function of |v| € [2, 2f9], which is minimized 
for |v| = 2fo. We find 


To(Co (lvl, dih)) > to(C5 (2to, 1)) > 0.32. 


The term sol’ (y;, y3, ys) is maximized when y3 = 2f9, ys = 3.07, so that sol’ < 0.017. 
(This was checked with interval arithmetic in Mathematica.) Thus, 


T(Co(v)) = 0.32 — 2(0.017) yp > 0.297. 


If there are three or more concave corners, then the penalty-free corner cells squander 
at least 3(0.297). The penalty is at most 2max (Section 12.6). So the penalty-inclusive 
squander is more than 3(0.297) — max > (4r¢ — 8) pt. O 


Lemma 12.18. There are no concave corners of height at most 2.2. 


Proof. Suppose there is a corner of height at most 2.2. Place an untruncated corner 

cell Co(|v|, dih) with parameter A = 1.815 at that corner and a fo-cone wedge at every 
other corner. The subcluster squanders at least To(Co(|v|, 7 )) — max. This is an explicit * 
monotone decreasing rational function of one variable. The penalty-inclusive squander 

is at least 


1(C¥(2to, 1)) — Tmax > (406 — 8) pt. ox 


By the assumptions at the beginning of Section 12.9, the lemma implies that each 
concave corner has distance at least 3.2 from every other visible corner. 

As in the previous lemma, when A = 1.945, a lower bound on what is squandered 
by the corner cell is obtained for |v| = 2f, dih = za. The explicit formulas give 
penalty-free squander > 0.734. Two disjoint corner cells give penalty-inclusive squander 
> (4r¢ — 8) pt. Suppose two at v1, v2 overlap. The lowest bound is obtained when 
|v; — v2| = 3.2, the shortest distance possible. 
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We define a function f (31, y2) that measures what the union of the overlapping corner 
cells squander. Set y; = |v;|, € = 3.2, and 


a, = dih(y1, to, y2,4, €,4), 
a2 = dih(y2, fo, 1,4, £, A), 

sol = sol(y2, fo, ¥1,4, £,A), 

gi = P(yi/2, to), i= 1,2, 
A = 3.2 —t) = 1.945, 


2 
fOr. 2) = 2(Cpt— go) sol +2 Y° aj (1 — yi/(2t0)) (Go — gi) 
1 


2 
+>) w(C(i,4, % — 204)). 
1 


An interval calculation”® gives f(y1, y2) > (4 — 8) pt + max, for y1, yo € [2, 2f0]. 

We conclude that there is at most one concave corner. Let v be such a corner. If we 
push v toward the origin, the solid angle is unchanged and vorpg is increased. Following 
this by the deformation of Section 12.8, we maintain the constraints |v — w| = 3.2, for 
adjacent corners w, while moving v toward the origin. Eventually |v| = 2.2. This is 
impossible by Lemma 12.18. 

We verify that this deformation preserves the constraint |v — w| > 2, for all corners 
w such that {v, w} lies entirely outside the subregion. If fact, every corner is visible from 
v, so that the subregion is star convex at v. We leave the details to the reader. 

We conclude that all subregions can be deformed into convex polygons. 


12.13. Proof that Distances Remain at Least 2 


Remark 12.19. In Section 12.7, to allow for more flexible deformations, we drop all 
constraints on the lengths of (undistinguished) edges {v,, v2} that cross the boundary of 
the subregion. We deform in such a way that the edges {v,, v2} will maintain a length of 
at least 2. 


Recall that we say that a vertex of a subregion is convex if its angle is less than zr, 
and otherwise that is concave (Definition 12.8). In general, if P is a subregion and p; 
and p2 are two vertices of P, there is a minimal curve joining p; and pp inside P. This 
curve is a finite sequence e;, ..., e, of spherical geodesics. We refer to this sequence as 
the sequence of arcs from p; to p2. The endpoint of each spherical arc is a vertex of P. 
All endpoints except possibly p; and p2 are nonconvex. These endpoints are the radial 
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projections of corners of P: vo, v1, ..., Ur+1, With p(vo) = pi and p(v;41) = p2. The 
vertex p, is visible from po if and only ifr = 1. 


Lemma 12.20. This deformation of a subregion at a concave corner v maintains a 
distance of at least 2 to every other corner w. 


Proof. The proof is by contradiction. We may assume that |v — w| < /8. We may 
assume that v and w are the first corners to violate the condition of being at least 2 
apart, so that distances between other pairs of corners are at least 2. A distinguished edge 
connects v and w, if w is visible from v. So assume that w is not visible. Let e(v,, v2) 
be the first distinguished edge crossed by the geodesic arc g from p(v) to p(w). Let po 
be the intersection of e(v;, v2) and g. By construction, the deformation moves v into the 
subregion, and the subregion P is concave at the corner v, so that the arc from p(v) to 
p(w) begins in P, then crosses out at e(v1, v2). 

Geometric considerations show that |v; —v2| > 2.91. In fact, geometric considerations 
show that the shortest possible distance for |v; — v2| under the condition that |v— w| < 2 
is the length of the segment passing through the triangle of sides 2, 2t9, 2% with both 
endpoints at distance exactly 2 from all three vertices of the triangle. This distance is 
greater than 2.91. 

Let e),..., e, be the sequence of arcs from p(v) to p(v,), and let f|,..., f, be the 
sequence of arcs from p(v) to p(v2). Since this sequence forms a minimal curve, the 
sum of the lengths of e; is at most the sum of the lengths of e(v, po) and e(po, v1), and 
the sum of the lengths of f; is at most the sum of the lengths of e(v, po) and e(po, v2). 

Note that ifr +s < 4, then one of the edge-lengths must be at least 3.2, for otherwise 
the sequence of arcs are all distinguished or diagonals of specials, and this would not 
permit the existence of a corner w. That is, we can fully enumerate the corners of the 
subregion, and each projects radially to an endpoint in the sequence of arcs, or is a vertex 
of a special simplex. None of these corners is separated from v by the plane {0, v1, v2}. 

We have r + s < 3 by the following calculations; here y € [2, 2f0]: 


Sarc(2to, 2to, 2) > arc(2, 2, 3.2) + 2arc(2, 2, 2), 

3 arc(2to, 29, 2) + arc(2to, y, 3.2) > arc(y, 2, 3.2) + 2arc(2, 2, 2), 

3 arc(2%, 2t9, 2) + arc(2to, y, 3.2) > arc(2, 2, 3.2) + 2arc(y, 2, 2). 
First we prove the lemma in the special case that the distance from v to one of the 
endpoints, say v1, of {v;, v2} is at least 3.2. In this special case we claim that the constraints 
on the edge-lengths creates an impossible geometric configuration. The constraints are 


as follows. There are five points: 0, v;, w, v, v2. The plane {0, v1, v2} separates point w 
from v. The distance constraints are as follows: 


2 < |u| < 2h, 


foru = vj, W, V, v2, |v—V,| > 3.2, |v—w| < 2, |v—v2| > 2,|w—v1| > 2, |w—v2| > 2, 
and 2 < |v; — v2| < 3.2. 
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If the segment {v, w} passes through the triangle {0, v,, v2}, then the desired impos- 
sibility proof follows by geometric considerations. Again, if the segment {v;, v2} passes 
through the triangle {0, v, w}, then the desired impossibility proof follows by geometric 
considerations, provided that {0, v;, v2, w} are not coplanar. Assume for a contradiction 
that {0, v1, v2, w} lie in the plane P. We move back to the nonplanar case if |v2 — v| is 
not 2 (pivot v2 around {0, w} toward v), if |v; — v| is not 3.2 (pivot v; around {0, w} 
toward v), if |w — v| is not 2 (pivot w around {v,, v2} away from v), or v is not 2% (pivot 
v and w simultaneously preserving |w — v| around {v,, v2}). Therefore, we may assume 
without loss of generality that |v2 — v| = 2, |v; — v| = 3.2, |w — v| = 2, and |v| = 2%. 

Let p be the orthogonal projection of v to the plane P. Leth = |v — p|. The distances 
from ptou € Pis f(|v—u|, h) = V/|v — u|? — h2. We consider two cases depending on 
whether we can find a line in P through p dividing the plane into a half-plane containing 
v1, 0, and v2, or into a half-plane containing v;, w, and v2. In the first case we have 


0 = are(|p — vi|, |pl, |vil) + arc(|p — v9], |pl, |v21) 
— arc(|p — vi|, |p — val, [vi — va) 
> arc(f(3.2,h), f(2t, h), 2) + arc(f (2, h), f (2t, h), 2) 


— arce(f(3.2,h), f(2, h), 3.2). (12.3) 


The function arc is monotonic in the arguments and from this it follows easily that 
this function of h is positive on its domain 0 < h < /3. This is a contradiction. (The 
upper bound /3 is determined by the condition that the triangle {w, vj, v}, which is 
equilateral in the extreme case, exists under the given edge constraints.) In the second 
case, we obtain the related contradiction 


0 = are(|p — vj, |p — w], |v — w]) + are(|p — v2], |p — wl, |v2 — w)) 


= are(|p = 0;|,||p = val. |B1 — V3)) 

arc( f (3.2, h), f (2,h), 2) + are(f (2, h), f (2, A), 2) 

— arc(f (3.2, h), f (2, h), 3.2) 

> 0. (12.4) 


IV 


Now assume that the distances from v to the vertices v; and v2 are at most 3.2. 

Ifr +s = 2, then v; and v2 are visible from v. Thus, they are distinguished or 
diagonals of special simplices. As {v,, v2} is also distinguished, the corners of P are 
fully enumerated: v, vj, v2, and the vertices of special simplices. Since none of these are 
w, we conclude that w does not exist in this case. 

Ifr+s = 3,thensayr = lands = 2. We have {v, v;} is distinguished or the diagonal 
of a special simplex. Let p(v), p(u) be the endpoints of f;, for some corner u. We have 
|u — v,| > V8 because {u, v;} is not distinguished, and max(|u — v|, |u — v;|) > 3.2, 
because otherwise we enumerate all vertices of P as in the case r + s = 2, and find that 
w is not among them. However, now geometric considerations lead to a contradiction: 
there does not exist a configuration of five points 0, u, v, vj, v2, with all distances at 
least 2 satisfying these constraints. (This can be readily solved by geometric considera- 
tions.) 
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13. Convex Polygons 
13.1. Deformations 


We divide the bounding edges over the polygon according to length [2, 2to], [2t%, 2V2], 
(2, 3.2]. The deformations of Section 12.8 contract edges to the lower bound of the 
intervals (2, 2fo, or 2\/2) unless a new distinguished edge is formed. By deforming the 
polygon, we assume that the bounding edges have length 2, 2f, or 2\/2. (There are a 
few instances of triangles or quadrilaterals that do not satisfy the hypotheses needed for 
the deformations. These instances are treated in Sections 13.8 and 13.9.) 


Lemma 13.1. Let S = S(yj,..., yo) be a simplex, with x; = ys as usual. Let y4 > 2, 
A > 0, ys, Yo € {2, 2%0, a2). Fixing all the variables but x,, let f (x,) be one of the 
functions s-vorp(S) or —T)(S). We have f"(x,) > 0 whenever f'(x,) = 0. 


Proof. This is an interval calculation.” 


The lemma implies that f does not have an interior point local maximum for x; € 
[2s 2; Fix three consecutive corners, vo, v1, v2 of the convex polygon, and apply the 
lemma to the variable x; = |v,|? of the simplex S = {0, vo, v1, v2}. We deform the 
simplex, increasing f. If the deformation produces A(S) = 0, then some dihedral angle 
is 7, and the arguments for nonconvex regions bring us eventually back to the convex 
situation. Eventually y; is 2 or 2%9. Applying the lemma at each corner, we may assume 
that the height of every corner is 2 or 2f9. (There are a few cases where the hypotheses 
of the lemma are not met, and these are discussed in Sections 13.8 and 13.9.) 


Lemma 13.2. The convex polygon has at most seven sides. 


Proof. Since the polygon is convex, its perimeter on the unit sphere is at most a great 
circle 27. If there are eight sides, the perimeter is at least 8 arc(2t, 2), 2) > 27. 


13.2. Truncated Corner Cells 


The following lemma justifies using tcc’s at the corners as an upper bound on the score 
(and a lower bound on what is squandered). We fix the truncation parameter at A = 1.6. 


Lemma 13.3. Take a convex subregion that is not a triangle. Assume edges between 
adjacent corners have lengths € {2, 2to, 2/2, 3.2}. Assume nonadjacent corners are 
separated by distances > 3.2. Then the truncated corner cell at each vertex lies in the 
cone over the subregion. 
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Proof. Place atcc at v;. For a contradiction, let {v2, v3} be an edge that the tcc overlaps. 
Assume first that |v; — v;| > 2%, i = 2,3. Pivot so that |vy — v2] = |v, — v3] = 2p. 
Write S(y1,..-, Yo) = {0, v1, v2, v3}. Set w = arc(y1, fo, 1.6). A calculation’® gives 
By (1, Y2; Ye) < dih(S). 

Now assume |v; — v2| < 2f9. By the hypotheses of the lemma, |v; — v2| = 2. If 
|v; —v3| < 3.2, then {0, v1, v2, v3} is triangular, contrary to hypothesis. So |v; —v3| > 3.2. 
Pivot so that |v; — v3| = 3.2. Then” 


By (1, Y2, Yo) < dih2(S), 


where y = arc(y1, fo, 1.6), provided y; € [2.2, 2t0]. Also, if y; € [2.2, 20], then 


arc(y1, fo, 1.6) < arc(y1, y2, yo). 


If yy < 2.2, then A; > 0, so 0 dihy /dx3 < 0. Set x3 = 2t- Also, Ag > 0, so 
8 dihy /Ax4 < 0. Set x4 = 3.27. 

Let c be a point of intersection of the plane {0, v,, v}+ with the circle at distance 
A. = 1.6 from v; on the sphere centered at the origin of radius fo. The angle along {0, v2} 
between the planes {0, v2, vj} and {0, v2, c} is 


dih(R(y2/2, 7126, y1/(2 cos y))). 


This angle is less®° than dihy(S). Also, A, > 0, 0 dihs /0x2 < 0, so set x2 = 242. Then 
As < 0, so dihy > 2/2. This means that {0, v1, v2}t separates the tcc from the edge 
{v2, v3}. 0 


13.3. Analytic Continuation 


In this subsection we assume that 2 = 1.6 and that the tcc under consideration lies at a 
convex vertex. 

Assume that the face cut by {0, v, v; }+ meets the face cut by {0, v, vo}+. Let c; be the 
point on the plane {0, v, v;}+ satisfying |c; — v| = 1.6, |c;| = to. (Pick the root within 
the wedge between v; and v2.) The overlap of the two faces is represented in Fig. 13.1. 

We let co be the point of height fy on the intersection of the planes {0, v, v;}+ and 
{0, v, v2}+. We claim that co lies over the truncated spherical region of the tcc, rather than 
the wedges of fg-cones or the Rogers simplices along the faces {0, v, v;} and {0, v, v2}. 
(This implies that co cannot protrude beyond the corner cell as depicted in the second 
frame of the figure.) To see the claim, consider the tcc as a function of y4 = |v; — v2|. 
When ys is sufficiently large the claim is certainly true. Contract y4 until co = co(ya) 
meets the perpendicular bisector of {0, v}. Then co is equidistant from 0, v, v; and v2 so it 
is the circumcenter of {0, v, v1, v2}. It has distance fg from the origin, so the circumradius 
is tg. This implies that y4 < 2%. 
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Fig. 13.1. Different forms of truncated corner cells are shown. The structure shown in the middle frame 
cannot occur. 


The tcc is defined by the constraints represented in the third frame. The analytic 
continuation of the function x9(S) = x5"(S), defined above, acquires a volume X, 
counted with a negative sign, lying under the spherical triangle (co, c1, cz). Extending 
our notation, we have an analytically defined function xj" and a geometrically defined 
function ree 

xo (S) XG (S) — c-vorp(X), where 


c-vorp(X) = 4(—doct Vol(X) + sol(X)/3) = go sol(X) < 0. 


So xg" > ree and we may always use xo(S) = x§"(S) as an upper bound on the score 
of a tcc. 
For example, with A = 1.6 and S = S$(2.3, 2.3, 2.3, 2.9, 2, 2), we have 


xi"(S) © —0.103981, —_x8(S) & —0.105102, 
or, if S = S(2, 2, 2M, 3.2, 2, 2%), then 


Xo (S) © —0.0718957, x5 (S) © —0.0726143. 


13.4. Penalties 


In Section 12.6 we determined the bound of 2max = 0.06688 on penalties. In this section 
we give a more thorough treatment of penalties. Until now a penalty has been associated 
with a given standard region, but by taking the worst case on each subregion, we can 
move the penalties to the level of subregions. Roughly, each subregion should incur 
the penalties from the upright quarters that were erased along edges of that subregion. 
Each upright quarter of the original standard region is attached at an edge between 
adjacent corners of the standard cluster. The edges have lengths between 2 and 2%. 
The deformations shrink the edges to length 2. We attach the penalty from the upright 
quarter to this edge of this subregion. In general, we divide the penalty evenly among the 
upright quarters along a common diagonal, without trying to determine a more detailed 
accounting. For example, the penalty 0.008 in Lemma 11.23 comes from three upright 
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quarters. Thus, we give each of three edges a penalty of 0.008/3, or, if there are only two 
upright quarters around the 3-unconfined upright diagonal, then each of the two upright 
quarters is assigned the penalty 0.00222/2 (see Lemma 11.23). 

The penalty 0.04683 = 3é; in Section 12.6 comes from three upright quarters around 
a 3-crowded upright diagonal. Each of three edges is assigned a penalty of . The penalty 
0.03344 = 3p + &r comes from a 4-crowded upright diagonal of Section 11.7. It is 
divided among four edges. These are the only upright quarters that take a penalty when 
erased. (The case of two upright quarters over a flat quarter as in Lemma 11.3, are treated 
by a separate argument in Section 13.8. Loops are discussed in Section 13.12.) 

The penalty can be reduced in various situations involving a masked flat quarter. For 
example, around a 3-crowded upright diagonal, if there is a masked flat quarter, two of 
the upright quarters are scored by the analytic function s-vor, so that the penalty plus 
adjustment is only®!-*? 0.034052 = 2éy + ép + 0.0114. The adjustment 0.0114 reflects 
the scoring rules for masked flat quarters (Lemma 11.23). This we divide evenly among 
the three edges that carried the upright quarters. If e is an edge of the subregion R, let 
mto(R, e) denote the penalty and score adjustment along edge e of R. 

In summary, we have the penalties, 


Sas Ey, Ep, 0.008, 


combined in various ways in the upright diagonals that are 3-unconfined, 3-crowded, or 
4-crowded. There are score adjustments 


0.0114 and 0.0063 


from Section 11.9 for masked flat quarters. If the sum of these contributions is s, we set 
mo(R, e) = s/n, for each edge e of R originating from an erased upright quarter of S 


n* 


13.5. Penalties and Bounds 


Recall that the bounds for flat quarters we wish to establish from Section 12.5 are 
Z(3, 1) = 0.00005 and D(3, 1) = 0.06585. Flat quarters arise in two different ways. 
Some flat quarters are present before the deformations begin. They are scored by the 
rules of Section 11.9. Others are formed by the deformations. In this case they are scored 
by vorg. Since the flat quarter is broken away from the subregion as soon as the diagonal 
reaches 2/9. and then is not deformed further, the diagonal is fixed at 2/2. Such flat 
quarters can violate our desired inequalities. For example, 


Z(3, 1) < s-vorg(S(2, 2, 2, 2V2, 2, 2)) ¥ 0.00898, 


T(S(2, 2, 2, 2V2, 2, 2)) © 0.0593. 


On the other hand, as we will see, the adjacent subregion satisfies the inequality by a 
comfortable margin. Therefore, we define a transfer ¢ from flat quarters to the adjacent 
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subregion. (In an exceptional region, the subregion next to a flat quarter along the diagonal 
is not a flat quarter.) 
For a flat quarter Q, set 


__ | 0.0066 (deformation), 

é(Q) = fF (otherwise), 
__ } 0.009 (deformation), 

€0(Q) = (otherwise). 


The nonzero value occurs when the flat quarter Q is obtained by deformation from an 
initial configuration in which Q is not a quarter. The value is zero when the flat quarter 
Q appears already in the undeformed standard cluster. Set 


m(R) = )\m0(R,e) + >, mo(Q,e) +) ex(Q), 
e e Q 

a(R) = )_ mo(R,e) + >) m0(Q,e) + D> &o(Q). 
e e Q 


The first sum runs over the edges of a subregion R. The second sum runs over the edges 
of the flat quarters Q that lie adjacent to R along the diagonal of Q. 

The edges between corners of the polygon have lengths 2, 29, or 2\/2. Let ko, ky, 
and ky be the number of edges of these three lengths, respectively. By Lemma 13.2, we 
have ky + kj + kp < 7. Let o denote any of the functions of (a)—(f) of Section 11.9. Let 
Tt = sol€pt—o. 

To prove Theorem 12.1, refining the strategy proposed in Section 12.5, we must show 
that for each flat quarter Q and each subregion R that is not a flat quarter, we have 


7(Q) > D(3, 1) — €-(Q), 
™(Q) > D(3,1)—e(Q), if y4(Q) = 2V2, 
ty(R) > D(3,2) (type A), 

T(R) > Dlko + ki + ko, ki + ko) +77(R), 


(13.1) 


where D(n, k) is the function defined in Section 12.5. The first of these inequalities 
follows.*?-® In general, we are given a subregion without explicit information about 
what the adjacent subregions are. Similarly, we have discarded all information about 
what upright quarters have been erased. Because of this, we assume the worst, and use 
the largest feasible values of zr. 


Lemma 13.4. We have 1,(R) < 0.04683 + (ko + 2k2 — 3)0.008/3 + 0.006642. 
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Proof. The worst penalty 0.04683 = 3ép per edge comes from a 3-crowded upright 
diagonal. The number of penalized edges not on a simplex around a 3-crowded upright 
diagonal is at most ko + 2k — 3. For every three edges, we might have one 3-unconfined 
upright diagonal. The other cases such as 4-crowded upright diagonals or situations with 
a masked flat quarter are readily seen to give smaller penalties. 


For bounds on the score, the situation is similar. The only penalties we need to consider 
are 0.008 from Lemma 11.23. If either of the other configurations of 3-crowded or 4- 
crowded upright diagonals occurs, then the score of the standard cluster is less than 
Sg = —0.228, by Sections 11.6 and 11.7. This is the desired bound. So it is enough to 
consider subregions that do not have these upright configurations. Moreover, the penalty 
0.008 does not occur in connection with masked flats. So we can take 27, (R) to be 


(ko + 2kz)0.008/3 + 0.009k2. 


Ifky+2k2 < 3, wecan strengthen this to z,(R) = 0.009k2. Let o be any of the functions 
of (a)-(f) of Section 11.9. To prove Theorem 12.1, we will show 


a(Q) < Z(3,1)+&.(Q), 
s-voro(Q) < Z(3,1)+8(Q), if ya(Q) = 2V2, 
vorp(R) < Z(3, 2) (type A), 
voro(R) < Z(ko +k + ko, ki + ko) — 1, (R). 


(13.2) 


The first of these inequalities follows.8°-** 


13.6. Penalties 


Erasing an upright quarter of compression type gives a penalty of at most &r and one of 
Voronoi type gives at most y. We take the worst possible penalty. It is at most n&p in an 
n-gon. If there is a masked flat quarter, the penalty is at most 2&y from the two upright 
quarters along the flat quarter. We note in this connection that both edges of a polygon 
along a flat quarter lie on upright quarters, or neither does. 

If an upright diagonal appears enclosed over a flat quarter, the flat quarter is part of a 
loop with context (n, k) = (4, 1), for a penalty at most 2&} + &y. This is smaller than 
the bound on the penalty obtained from a loop with context (n,k) = (4, 1), when the 
upright diagonal is not enclosed over the flat quarter: 


Ep + 2éy. 


So we calculate the worst-case penalties under the assumption that the upright diagonals 
are not enclosed over flat quarters. 
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A loop of context (n, k) = (4, 1) gives rp + 2&y or 3&r. A loop of context (n, k) = 
(4, 2) gives 2ép or 2éy. 

If we erase a 3-unconfined upright diagonal, there is a penalty of 0.008 (or 0 if it 
masks a flat quarter). This is dominated by the penalty 3, of context (n, k) = (4, 1). 

Suppose we have an octagonal standard region. We claim that a loop does not occur 
in context (n,k) = (4, 2). If there are at most three vertices that are not corners of the 
octagon, then there are at most twelve quasi-regular tetrahedra, and the score is at most 


Sg + 12pt < 8 pt. 


Assume there are more than three vertices that are not corners over the octagon. We 
squander 


tg + Soop (4, 2) + 4tip(5, 0) > (406 _ 8) pt. 


As a consequence, context (n, k) = (4, 2) does not occur. 
So there are at most two upright diagonals and at most six quarters, and the penalty 
is at most 6&. Let f be the number of flat quarters This leads to 


6ér, f =9,1, 
4ép + 2éy, f =2, 
2ép + 4éy, f =3, 
0, f=4. 


p= 


The 0 is justified by a parity argument. Each upright quarter occurs in a pair at each 
masked flat quarter. However, there is an odd number of quarters along the upright 
diagonal, so no penalty at all can occur. 

Suppose we have a heptagonal standard region. Three loops are a geometric im- 
possibility. Assume there are at most two upright diagonals. If there is no context 
(n, k) = (4, 2), then we have the following bounds on the penalty: 


6ér, f =90, 
ee 4&p + 2éy, f=1, 
PF ) 3ér, f =2, 
Ep + 2éy, f=3 


If an upright diagonal has context (n, k) = (4, 2), then 


5ér, f =0, 1, 
Wr = 4 3&r + 2y, f =2, 
Er + 4éy, i 


This gives the bounds used in the diagrams of cases. 


13.7. Constants 


Theorem 12.1 now results from the calculation of a host of constants. Perhaps there 
are simpler ways to do it, but it was a routine matter to run through the long list of 
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constants by computer. What must be checked is that the inequalities (13.1) and (13.2) 
of Section 13.5 hold for all possible convex subregions. Call these inequalities the D and 
Z inequalities. This section describes in detail the constants to check. 

We begin with a subregion given as a convex n-gon, with at least four sides. The 
heights of the corners and the lengths of edges between adjacent edges have been re- 
duced by deformation to a finite number of possibilities (lengths 2, 2t9, or lengths 2, 
2to, 24/2. respectively). By Lemma 13.2, we may take n = 4,5, 6, 7. Not all possible 
assignments of lengths correspond to a geometrically viable configuration. One con- 
straint that eliminates many possibilities, especially heptagons, is that of Section 13.1: 
the perimeter of the convex polygon is at most a great circle. Eliminate all length- 
combinations that do not satisfy this condition. When there is a special simplex it can 
be broken from the subregion and scored*? separately unless the two heights along 
the diagonal are 2. We assume in all that follows that all specials that can be bro- 
ken off have been. There is a second condition related to special simplices. We have 
A(2t, 27,27, x?,2?, 27) < 0, if x > 3.114467. This means that if the cluster edges 
along the polygon are (y1, y2, V3, Ys, Yo) = (20, 2, 2, 2, 2), the simplex must be special 
(ya € [2V2, 3.2). 

The easiest cases to check are those with no special simplices over the polygon. In 
other words, these are subregions for which the distances between nonadjacent corners 
are at least 3.2. In this case we approximate the score (and what is squandered) by 
tcc’s at the corners. We use monotonicity to bring the fourth edge to length 3.2. We 
calculate the tcc constant bounding the score, checking that it is less than the constant 
Z(ko + ky + ko, ki + k2) — 16, from the Z inequalities. The D inequalities are verified 
in the same way. 

When n = 5, 6, 7, and there is one special simplex, the situation is not much more 
difficult. By our deformations, we decrease the lengths of edges 2, 3, 5, 6 of the special 
to 2. We remove the special by cutting along its fourth edge e (the diagonal). We score the 
special with weak bounds.” Along the edge e, we then apply deformations to the (n — 1)- 
gon that remains. If this deformation brings e to length 2/2, then the (n — 1)-gon may 
be scored with tcc’s as in the previous paragraph. However there are other possibilities. 
Before e drops to 2/2, a new distinguished edge of length 3.2 may form between two 
corners (one of the corners will be a chosen endpoint of e). The subregion breaks in 
two. By deformations, we eventually arrive at e = 2\/2 and a subregion with diagonals 
of length at least 3.2. (There is one case that may fail to be deformable to e = 2/2, 
a pentagonal cases discussed further in Section 13.10.) The process terminates because 
the number of sides to the polygon drops at every step. A simple recursive computer 
procedure runs through all possible ways the subregion might break into pieces and 
checks that the tec-bound gives the D and Z inequalities. The same argument works if 
there is a special simplex that overlaps each of the other special simplices in the subcluster. 

When n = 6, 7 and there are two nonoverlapping special simplices, a similar argument 
can be applied. Remove both specials by cutting along the diagonals. Then deform both 
diagonals to length 2,/2, taking into account the possible ways that the subregion can 
break into pieces in the process. In every case the D and Z inequalities are satisfied. 
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There are a number of situations that arise that escape this generic argument and were 
analyzed individually. These include the cases involving more than two special simplices 
over a given subregion, two special simplices over a pentagon, or a special simplex over 
a quadrilateral. Also, the deformation lemmas are insufficient to bring all of the edges 
between adjacent corners to one of the three standard lengths 2, 2f), 2/2 for certain 
triangular and quadrilateral regions. These are treated individually. 

The next few sections describe the cases treated individually. The cases not mentioned 
in the sections that follow fall within the generic procedure just described. 


13.8. Triangles 


With triangular subregions there is no need to use any of the deformation arguments 
because the dimension is already sufficiently small to apply interval arithmetic directly 
to obtain our bounds. There is no need for the tcc-bound approximations. 

Flat quarters and simplices of type A are treated by a computer calculation.°! Other 
simplices are scored by the truncated function s-vorg. We break the edges between corners 
into the cases [2, 2), [2to, aj), and [2V/2, 3.2]. Letko, ki, and ky, withkyp+k,+ky = 3, 
be the number of edges in the respective intervals. 

If ky = 0, we can improve the penalties, 


Tp =, = 0. 


To see this, first we observe that there can be no 3-crowded or 4-crowded upright diag- 
onals. By placing three or more quarters around an upright diagonal, if the subregion 
is triangular, the upright diagonal becomes surrounded by anchored simplices, a case 
deferred until Section 13.12. 

If ko = kj = ky = 1, we can take wr, = &p + 2€y + 0.0114 = 0.034052. A few cases 
are needed to justify this constant. If there are no 3-crowded upright diagonals, zr is at 
most 


[&r + 2€y + &.7]3/4 < 0.0254, 
or 
[E> + 2&y + & 1 ]2/4 + 0.008/3 < 0.0254 


If there are at most two edges in the subregion coming from a 3-crowded upright diagonal, 
then 


(Ep + 2&y + 0.0114)2/3 + 0.008/3 < 0.0254. 


If three edges come from the simplices of a 3-crowded upright diagonal, we get 0.034052. 
To get somewhat sharper bounds, we consider how the edge kz was formed. If it is 
obtained by deformation from an edge in the standard region of length > 3.2, then it 
becomes a distinguished edge when the length drops to 3.2. If the edge in the standard 
region already has length < 3.2, then it is distinguished before the deformation process 
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begins, so that the subregion can be treated in isolation from the other subregions. We 
conclude that when zr! = 0.034052 we can take y4 > 2.6 or ys = 3.2 (Remark 11.22). 
The D and Z inequalities now follow.??:” 


13.9. Quadrilaterals 


We introduce some notation for the heights and edge lengths of a convex polygon. The 
heights will generally be 2 or 29, the edge lengths between consecutive corners will 
generally be 2, 2to, or 2\/2. We represent the edge lengths by a vector 


(a1, bi, a, bo, s++5Qn, bn), 


if the corners of an n-gon, ordered cyclically have heights a; and if the edge length 
between corner i andi + | is b;. We say two vectors are equivalent if they are related 
by a different cyclic ordering on the corners of the polygon, that is, by the action of the 
dihedral group. 

The vector of a polygon with a special simplex is equivalent to one of the form 


Q, 2, a2, 2s 2, whe ). 


If aj = 2to, then what we have is necessarily special (Section 13.7). However, if az = 2, 
it is possible for the edge opposite a2 to have length greater than 3.2. 

Turning to quadrilateral regions, we use tcc scoring if both diagonals are greater than 
3.2. Suppose that both diagonals are between [2/2, 3.2], creating a pair of overlap- 
ping special simplices. The deformation lemma requires a diagonal longer than 3.2, so 
although we can bring the quadrilateral to the form 


(a1, pp 2; 25 2, 2, a4, ba), 
the edges a), a4, by and the diagonal vary** continuously. We have bounds” on the score 


tT > 0.235, vory < —0.075, if bye [2tp, 2/2], 
T > 0.3109, vorg < —0.137, if bye [2/2 321: 


We have D(4, 1) = 0.2052, Z(4, 1) = —0.05705. When by € [2%0, 2/2], we can take 
I, = Mo = 0. (We are excluding loops here.) When by € [2./2, 3.2], we can take 


Ir = Tmax + 0.0066, 
Ie = 0.008(5/3) + 0.009. 


It follows that the D and Z inequalities are satisfied. 
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Suppose that one diagonal has length [2\/2, 3.2] and the other has length at least 3.2. 
The quadrilateral is represented by the vector 


(2, 2, a, 2, 2, bz, ag, ba). 


The hypotheses of the deformation lemma hold, so that a; € {2, 2t)} and b; € {2, 210, 54/3), 
To avoid quad clusters, we assume by > max(b3, 2fo). These are one-dimensional with 
a diagonal of length [2\/2, 3.2] as the parameter. The required verifications”® have been 
made by interval arithmetic. 


13.10. Pentagons 


Some extra comments are needed when there is a special simplex. The general argu- 
ment outlined above removes the special, leaving a quadrilateral. The quadrilateral is 
deformed, bringing the edge that was the diagonal of the special to 2\/2. This section 
discusses how this argument might break down. 

Suppose first that there is a special and that both diagonals on the resulting quadri- 
lateral are at least 3.2. We can deform using either diagonal, keeping both diagonals at 
least 3.2. The argument breaks down if both diagonals drop to 3.2 before the edge of the 
special reaches 2./2 and both diagonals of the quadrilateral lie on specials. When this 
happens, the quadrilateral has the form 


(2, 2, 2, 2, 2, 2, 2, ba), 


where by is the edge originally on the special simplex. If both diagonals are 3.2, this is 
rigid, with by = 3.12. We find its score to be 


s-vorg(S(2, 2, 2, ba, 3.2, 2)) + s-voro(S(2, 2, 2, 3.2, 2, 2)) + 0.0461 < —0.205, 
t(S(2, 2, 2, ba, 3.2, 2)) + t(S(2, 2, 2, 3.2, 2, 2))2 > 0.4645. 


So the D and Z inequalities hold easily. 
If there is a special and there is a diagonal on the resulting quadrilateral < 3.2, we 
have two nonoverlapping specials. It has the form 


(2, 2, do, 2, 2, 2, a4, 2, 2, bs). 


The edges az and ay lie on the special. If bs > 2, cut away one of the special simplices. 
What is left can be reduced to a triangle, or a quadrilateral case and then treated®’ by 
computer. Assume bs = 2. We have a pentagonal standard region. We may assume 
that there is no 3-crowded or 4-crowded upright diagonal, for otherwise Theorem 12.1 
follows trivially from the bounds in Section 9. A pentagon can then have at most a 
3-unconfined upright diagonal for a penalty of 0.008. 
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If dg = 2to or a4 = 2%), we again remove a special simplex and produce triangles, 
quadrilaterals, or the special cases treated by computer.?® We may impose the condition 
dz = a4 = bs = 2. We score this full pentagonal arrangement by computer,” using the 
edge lengths of the two diagonals of the specials as variables. The inequalities follow. 


13.11. Hexagons and Heptagons 


We turn to hexagons. There may be three specials whose diagonals do not cross. Such a 
subcluster is represented by the vector 


(2, 2, a2, 2, 2,2, aq, 2, 2, 2, ag, 2). 


The heights az; are 2 or 2%. Draw the diagonals between corners 1, 3, and 5. This is a 
three-dimensional configuration, determined by the lengths of the three diagonals, which 
is treated by computer! 

There is one case with a special simplex that did not satisfy the generic computer- 
checked inequalities for what is to be squandered. Its vector is 


(a1, 2, 2, 2, 2,2, 2, b4, 2, 2, 2, 2), 


with a; = by = 2t9. A vertex of the special simplex has height a; = 2f9 and all other 
corners have height 2. The subregion is a hexagon with one edge longer than 2. We have 
D(6, 1) = 0.48414. This is certainly obtained if the subregion contains a 3-crowded 
upright diagonal, squandering 0.5606. However, if this configuration does not appear, 
we can decrease zr, to 0.03344 + (2/3)0.008, a constant coming from 4-crowded upright 
diagonals in Section 12.6. With this smaller penalty the inequality is satisfied. 

Now turn to heptagons.The bound 27 on the perimeter of the polygon eliminates all 
but one equivalence class of vectors associated with a polygon that has two or more 
potentially specials simplices. The vector is 


(2, 2, a2, 2, 2; 2, a4, 2, 2, 2; a6, 2, a7, 2), 


ay = a4 = do = a7 = 2%. In other words, the edges between adjacent corners are 2 
and four heights are 29. There are two specials. This case is treated by the procedure 
outlined for subregions with two specials whose diagonals do not cross. 


13.12. Loops 


We now return to a collection of anchored simplices that surround the upright diagonal. 
This is the last case needed to complete the proof of Theorem 12.1. There are four or 
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five anchored simplices around the upright diagonal. There are linear inequalities!0'~!0° 
satisfied by the anchored simplices, broken up according to type: upright, type C, opposite 
edge > 3.2, etc. The anchored simplices are related by the constraint that the sum of 
the dihedral angles around the upright diagonal is 277. We run a linear program in each 
case based on these linear inequalities, subject to this constraint to obtain bounds on the 
score and what is squandered by the anchored simplices. 

When the edge opposite the diagonal of an anchored simplex has length € [2/2, 3.2] 
and the simplex adjacent to the anchored simplex across that edge is a special simplex, 
we use inequalities!°7:!°8 that run parallel to the similar system.'°?:!!° It is not necessary 
to run separate linear programs for these. It is enough to observe that the constants for 
what is squandered improve on those from the similar system!'! and that the constants 
for the score in one system!!? differ with those of the other!!? by no more than 0.009. 

When the dihedral angle of an anchored simplex is greater than 2.46, the simplex 
is dropped, and the remaining anchored simplices are subject to the constraint that 
their dihedral angles sum to at most 27 — 2.46. There cannot be an anchored simplex 
with dihedral angle greater than 2.46 when there are five anchors: 2.46 + 4(0.956) > 
2x. There cannot!!* be two anchored simplices with dihedral angle greater than 2.46: 


2(2.46 + 0.956) > 27. 
The following table summarizes the linear programming results: 


(n, k) Dip, k) D(n, k) Zip (1, k) Z(n,k) 


(4, 0) 0.1362 0.1317 0 0 

(4, 1) 0.208 0.20528 —0.0536 —0.05709 

(4, 2) 0.3992 0.27886 —0.2 —0.11418 

(4, 3) 0.6467 0.35244 —0.424 —0.17127 

(5, 0) 0.3665 0.27113 —0.157 —0.05704 

(5, 1) 0.5941 0.34471 —0.376 —0.11413 
(5, > 2) 0.9706 (Ae — 8) pt * * 


The bound for D(4, 0) comes from Lemma 10.8. A few more comments are needed 
for Z(4, 1). Let S = S(1,..., y6) be the anchored simplex that is not a quarter. If 
yy > 2/2 or dih($) > 2.2, the linear programming bound is < Z(4, 1). With this, if 
y, < 2.75, we have!!> o(S) < Z(4, 1). However, if y, => 2.75, the three upright quarters 
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along the upright diagonal satisfy 
v < —0.3429 + 0.24573 dih. 


With this stronger inequality, the linear programming bound becomes < Z(4, 1). This 
completes the proof of Theorem 12.1. 


Lemma 13.5. Consider an upright diagonal that is a loop. Let R be the standard region 
that contains the upright diagonal and its surrounding simplices. Then the following 
contexts (m,k) are the only ones possible. Moreover, the constants that appear in the 
columns marked o and t are upper and lower bounds respectively for Tr(D) when R 
contains one loop of that context. 


n=n(R) (m,k) oO T 
4 
(4,0) —0.0536 0.1362 
5 
(4, 1) 85 0.27385 
(5,0) —0.157 0.3665 
6 
(4, 1) S6 0.41328 
(4,2) —0.1999 0.5309 
(5,1) —0.37595 0.65995 
7 
(4, 1) S7 0.55271 
(4,2)  —0.25694 0.67033 
8 


(4, 1) Sg 0.60722 
(4,2) —0.31398 0.72484 


Proof. Incontext (m, k), and if nm = n(R), we have 


oR(D) < Sn + ZLp(m, k) — Z(m,k), tr(D) > t + Dip(m, k) — Dim, k). 


The result follows. 


In the context (n, k) = (4, 3), the standard region R must have at least seven sides, 
n(R) > 7. Then 


t(D) = or Soop (4, 3) 
> (47 — 8) pt. 


Thus, we may assume that this context does not occur. 
If the context (5, 1) appears in an octagon, we have 


T(D) > Sloop (5, 1) + tg > (4g — 8) pt. 
If this appears in a heptagon, we have 
T(D) > Stoop (5, 1) + t7 + 0.55 pt > (4¢ — 8) pt, 


because there must be a vertex that is not a corner of the heptagon. It cannot appear on 
a pentagon. 
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14. Further Bounds in Exceptional Regions 
14.1. Small Dihedral Angles 


Recall that Section 12.1 defines an integer n(R) that is equal to the number of sides if the 
region is a polygon. Recall that if the dihedral angle along an edge of a standard cluster 
is at most 1.32, then there is a flat quarter along that edge (Lemma 11.30). 


Lemma 14.1. Let R be an exceptional cluster with a dihedral angle < 1.32 at a vertex 
v. Then R squanders > t, + 1.47 pt, wheren = n(R). 


Proof. In most cases we establish the stronger bound f, + 1.5 pt. In the proof of Theo- 
rem 12.1 we erase all upright diagonals, except those completely surrounded by anchored 
simplices. The contribution to ¢, from the flat quarter Q at v in that proof is D(3, 1) (Sec- 
tion 12.5 and inequalities (13.1)). Note that ¢,(Q) = 0 here because there are no defor- 
mations. If we replace D(3, 1) with 3.07 pt from Lemma 11.30, then we obtain the bound. 
Now suppose the upright diagonal is completely surrounded by anchored simplices. An- 
alyzing the constants of Section 13.12, we see that DLp(m, k) — D(n, k) > 1.5 pt except 
when (n, k) = (4, 1). 

Here we have four anchored simplices around an upright diagonal. Three of them are 
quarters. We erase and take a penalty. Two possibilities arise. If the upright diagonal is 
enclosed over the flat quarter, its height is > 2.6 by geometric considerations and the 
top face of the flat quarter has circumradius at least /2. The penalty is 2éL + éy, so the 
bound holds by the last statement of Lemma 11.30. 

If, on the other hand, the upright diagonal is not enclosed over the flat diagonal, the 
penalty is > + 2&y. In this case we obtain the weaker bound 1.47 pt + f,: 


3.07 pt > DB, 1) + 1.47 pt+ ép + 2&y. 


Remark 14.2. If there are r nonadjacent vertices with dihedral angles < 1.32, we find 
that R squanders t, + r(1.47) pt. 


In fact, in the proof of the lemma, each D(3, 1) is replaced with 3.07 pt from 
Lemma 11.30. The only questionable case occurs when two or more of the vertices 
are anchors of the same upright diagonal (a loop). Referring to Section 13.12, we have 
the following observations about various contexts: 


e (4, 1) can mask only one flat quarter and it is treated in the lemma. 
e (4, 2) can mask only one flat quarter and Dy,p(4, 2) — D(4, 2) > 1.47 pt. 
e (5,0) can mask two flat quarters. Erase the five upright quarters, and take a penalty 
4éy + Er. We get 
D(3, 2) + 2(3.07) pt > ts + 4€y + &p + 21.47) pt. 


e (5, 1) can mask two flat quarters, and DL p(5, 1) — D(5, 1) > 21.47) pt. 
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Fig. 14.1. A 4-circuit. 


14.2. A Particular 4-Circuit 


This subsection bounds the score of a particular 4-circuit on a contravening plane graph. 
The interior of the circuit consists of two faces: a triangle and a pentagon. The circuit and 
its enclosed vertex are show in Fig. 14.1 with vertices marked p, ..., ps. The vertex p; 
is the enclosed vertex, the triangle is (p), p2, ps) and the pentagon is (pj,..., Ps). 

Suppose that D is adecomposition star whose associate graph contains such triangular 
and pentagonal standard regions. Recall that D determines a set U(D) of vertices in 
Euclidean 3-space of distance at most 2%) from the origin, and that each vertex p; can be 
realized geometrically as a point on the unit sphere at the origin, obtained as the radial 
projection of some v; € U(D). 


Lemma 14.3. One of the edges {v,, v3}, {v,, v4} has length less than 2\/2. Both of the 
them have lengths less than 3.02. Also, |v,| > 2.3. 


Proof. This is a standard exercise in geometric considerations as introduced in Sec- 
tion 4.2. (The reader should review that section for the framework of the following 
argument.) We deform the figure using pivots to a configuration v2, ..., vs at height 2, 
and |v; — v;| = 2%, (i, j) = (2, 3), (3, 4), (4, 5), (5, 2). We scale vj until |v; | = 2%. We 
can also take the distance from v; to vs and to v2 to be 2. If we have |v; — v3| > a, 
then we stretch the edge |v; — va| until |v; — v3] = 2/2. The resulting configuration 
is rigid. Pick coordinates to find that |v; — v4| < 2/2. If we have vy — v3| > 260, 
follow a similar procedure to reduce to the rigid configuration |v; — v3| = 2%, to find 
that |v; — v4| < 3.02. The estimate |v;| > 2.3 is similar. 


There are restrictive bounds on the dihedral angles of the simplices {0, v1, v;, v;} 
along the edge {0, v;}. The quasi-regular tetrahedron has a dihedral angle of at most!!° 
1.875. The dihedral angles of the simplices {0, v1, v2, v3}, {0, v1, vs, v4} adjacent to it are 


116 CaLc-984463800. 


226 


Sphere Packings, IV 159 


at most!!7 1.63. The dihedral angle of the remaining simplex {0, v1, v3, v4} is at most!!® 
1.51. This leads to lower bounds as well. The quasi-regular tetrahedron has a dihedral 
angle that is at least 27 — 2(1.63) — 1.51 > 1.51. The dihedral angles adjacent to the 
quasi-regular tetrahedron is at least 27 — 1.63 — 1.51 — 1.875 > 1.26. The remaining 
dihedral angle is at least 27 — 1.875 — 2(1.63) > 1.14. 

A decomposition star D determines a set of vertices U(D) that are of distance at 
most 2f9 from the center of D. Three consecutive vertices p;, p2, and p3 of a standard 
region are determined as the projections to the unit sphere of three corners v,, v2, and 
v3, respectively in U(D). By Lemma 11.30, if the interior angle of the standard region 
is less than 1.32, then |v; — v3| < V8. 


Lemma 14.4. These two standard regions F = {R,, Ro} give tr(D) > 11.16 pt. 


Proof. Let dih denote the dihedral angle of a simplex along a given edge. Let S;; be the 
simplex {0, v;, v;, vj}, for (i, 7) = (2, 3), (3, 4), (4,5), (2, 5). We have ye dih($;;) = 
2x. Suppose one of the edges {v;, v3} or {v1, v4} has length > 2/2. Say {v1, v3}. 

We have!!® 


T (S25) — 0.2529 dih(S25) > —0.3442, 
T (S23) — 0.2529 dih( S23) > —0.1787, 
T (S45) — 0.2529 dih(Sys) > —0.2137, 
T (S34) — 0.2529 dih(S34) > —0.1371. 
We have a penalty &r for erasing, so that 
t(D) > Y) t(Sij) — Sér 
(4) 
> 21 (0.2529) — 0.3442 — 0.1787 — 0.2137 — 0.1371 — 5p 
> 11.16prt, 
where T, = T, T, T] aS appropriate. 
Now suppose {v}, v3} and {v;, v4} have length < 2/2. If there is an upright diagonal 


that is not enclosed over either flat quarter, the penalty is at most 3&p + 2&y. Otherwise, 
the penalty is smaller: 4&}, + &y. We have 


t(D) => (Sj) — Gér + 2év) 


(4) 
> 27 (0.2529) — 0.3442 — 2(0.2137) — 0.1371 — (3&p + 2éy) 


> 11.16 pt. 
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14.3. A Particular 5-Circuit 

Lemma 14.5. Assume that R is a pentagonal standard region with an enclosed vertex 
v of height at most 2tg. Assume further that: 


e |v;| < 2.168 for each of the five corners. 
e Each interior angle of the pentagon is at most 2.89. 
e If v4, v2, v3 are consecutive corners over the pentagonal region, then 


|v; — v2| + |v2 — v3| < 4.804. 


e ba. lu; — vi41| < 11.407. 
Then or(D) < —0.2345 or Tr(D) > 0.6079. 


Proof. Since —0.4339 is less than this the lower bound, a 3-crowded upright diagonal 
does not occur. Similarly, since —0.25 is less than the lower bound, a 4-crowded upright 
diagonal does not occur (Lemma 11.18 and Definition 11.7). 

Suppose that there is a loop in context (n, k) = (4, 2). Again by Lemma 13.5 (with 
n(R) = 7), 


or(D) < —0.2345. 


We conclude that all loops have context (n, k) = (4, 1). 


Case 1. The vertex v = v,2 has distance at least 2to from the five corners of U(D) over 
the pentagon. The penalty to switch the pentagon to a pure vorg score is at most 5ép (see 
Section 12.6). There cannot be two flat quarters because then 


lvj2| > E(S(2, 2, 2, 2tp, 22, 202), 2to, 2ty, 2t) > 2to. 


Case la. Suppose there is one flat quarter, |v; — v4| < 22. There is a lower bound of 
1.2 on the dihedral angles of the simplices {0, v12, v;, vj+1}. This is obtained as follows. 
The proof relies on the convexity of the quadrilateral region. We leave it to the reader to 
verify that the following pivots can be made to preserve convexity. Disregard all vertices 
except U1, V2, U3, U4, Vi2. We give the argument that dih(0, vj2, v1, v4) > 1.2. The others 
are similar. Disregard the length |v; — v4|. We show that 


sd i= dih(O, V12, U1, V2) + dih(0, U12, V2, U3) 
+ dih(O, vj2, v3, v4) < 27 — 1.2. 
Lift v2 so |vj2| = 2%. Maximize sd by taking |v; — v2| = |v2 — v3| = |v3 — v4| = 2%. 
Fixing v3 and v4, pivot v; around {0, v;2} toward v4, dragging v2 toward v12 until |v2 — 


V12| = 2%. Similarly, we obtain |v3—v12| = 2f9. We now have sd < 3(1.63) < 27 —1.2, 
by a calculation. !”° 


120 CaLC-821707685. 
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Return to the original figure and move v1 without increasing |v | until each simplex 
{0, vi2, vj, Vi41} has an edge (vj2, v;) of length 2f9. Interval calculations!?! show that 
the four simplices around vj2 squander 


2m (0.2529) — 3(0.1376) — 0.12 > (4aé — 8) pt+ 5ép. 
Case 1b. Assume there are no flat quarters. By hypothesis, the perimeter satisfies 
Yo lui = visi] < 11.407. 


We have arc(2, 2, x)” = 2x/(16— x’)3/? > 0. The arclength of the perimeter is therefore 
at most 


2 arc(2, 2, 2to) + 2arc(2, 2, 2) + arc(2, 2, 2.387) < 27. 


There is a well-defined interior of the spherical pentagon, a component of area < 277. 
If we deform by decreasing the perimeter, the component of area < 27 does not get 
swapped with the other component. 


Disregard all vertices but vj,..., Us, Viz. If a vertex v; satisfies |v; — vi2| > 260, 
deform v; as in Section 12.8 until |v;_; — v;| = |v; — vj44| = 2, or |v; — v2] = 2%. 
If at any time, four of the edges realize the bound |v; — v;+1| = 2, we have reached an 


impossible situation, because it leads to the contradiction!22 
(5) 
Qn = y-dih < 151-241, 16) = 2x. 


(This inequality relies on the observation, which we leave to the reader, that in any such 
assembly, pivots can by applied to bring |v12 — v;| = 2%9 for at least one edge of each of 
the five simplices.) 

The vertex vj2 may be moved without increasing |v,2| so that eventually by these 
deformations (and reindexing if necessary) we have |v12 — v;| = 2%, i = 1,3, 4. Uf we 
have i = 1, 2, 3, the two dihedral angles along {0, v2} satisfy!?3 < 2(1.51) < z, so the 
deformations can continue.) 

There are two cases. In both cases |v; — vj2| = 2%, fori = 1, 3, 4. 


(i) [v2 — va] = |v12 — v5] = 2%. 
(ii) |vi2 _ v2| = 2to, [v4 = Us | = | Us = v1| =2. 


Case (i) follows from interval calculations!?* 


Y~ 1 = 2m (0.2529) — 5(0.1453) > 0.644 + 7ér. 


In case (ii) we have again 


27 (0.2529) — 5(0.1453). 
121 CaLc-467530297 and CALC- 135427691. 
122 CaLc-115383627 and CALC-603 145528. 


123 CaLc-115383627. 
124 Catc-312132053. 
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In this interval calculation we have assumed that |vj2 — us| < 3.488. Otherwise, setting 
S = (v12, v4, Us, v1), we have 


A(S) < A(3.4882, 4, 4, 8, (2t0), (2t0)”) < 0, 


and the simplex does not exist. (vg — v1 | > 2/2 because there are no flat quarters.) This 
completes Case 1. 


Case 2: The vertex v12 has distance at most 2to from the vertex v, and distance at least 2to 

from the others. Let {0, v13} be the upright diagonal of a loop (4, 1). The vertices of the 
loop are not {v2, v3, v4, Vs} with vj2 enclosed over {0, v2, vs, vi3} by Lemma 11.5. The 
vertices of the loop are not {v2, v3, v4, Us} with vj enclosed over {0, v;, v2, vs} because 
this would lead to a contradiction: 


yi2 = E(S(2, 2, 2, 2to, 2fp, 3.2), 2f0, 2f, 2) > 2tp 
or 
yi2 = E(S(Q2, 2, 2, 2, 2t9, 3.2), 2, 2to, 2) > 2t0. 


We get a contradiction for the same reasons unless {v1, vj2} is an edge of some upright 
quarter of every loop of type (4, 1). 

We consider two cases. (Case 2a) There is a flat quarter along an edge other than 
{v,, Vj2}. That is, the central vertex is v2, v3, V4, Or Us. (Recall that the central vertex of 
a flat quarter is the vertex other than the origin that is not an endpoint of the diagonal.) 
(Case 2b) Every flat quarter has central vertex v1. 

Case 2a. We erase all upright quarters including those in loops, taking penalties as 
required. There cannot be two flat quarters by geometric considerations 


(SC, 2.2 3/2, 2, Bia), Bi Dp, 2) & Dh, 
(S022 Of BA Day) 2, Di, Big) Ss Dm, 


The penalty is at most 7. We show that the region (with upright quarters erased) 
squanders > 7& + 0.644. We assume that the central vertex is v2 (Case 2a(i)) or v3 
(Case 2a(ii)). In Case 2a(i), we have three types of simplices around v12, characterized 
by the bounds on their edge lengths. Let {0, vj2, v1, vs} have type A, {0, v12, vs, v4} and 
{0, v12, v4, v3} have type B, and let {0, vi2, v3, vi} have type C. In Case 2a(i1) there are 
also three types. Let {0, vi2, v1, v2} and {0, vi2, v1, vs} have type A, {0, vi2, vs, v4} type 
B, and {0, vj2, v2, v4} type D. (There is no relation here between these types and the 
types of simplices A, B, and C defined in Section 9.) Upper bounds on the dihedral 
angles along the edge {0, v2} are given as calculations.'*> These upper bounds come as 
a result of a pivot argument similar to that establishing the bound 1.2 in Case la. 

These upper bounds imply the following lower bounds. In Case 2a(i), 


dih > 1.33 (A), 
dih > 1.21 (B), 
dih > 1.63 (C), 


125 CaLC-821707685, CALC-115383627, CALC-576221766, and CALC-122081309. 
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and in Case 2a(ii), 


dih > 1.37. (A), 
dih > 1.25 (B), 
dih > 1.51 (D), 


In every case the dihedral angle is at least 1.21. In Case 2a(i), the inequalities give a 
lower bound on what is squandered by the four simplices around {0, vj2}. Again, we 
move v;2 without decreasing the score until each simplex {0, v12, v;, vi+1} has an edge 
satisfying |vj2 — v;| < 2%. Interval calculations!?6 give 


» T > 27(0.2529) — 0.2391 — 2(0.1376) — 0.266 


(4) 
> 0.808. 


In Case 2a(ii), we have!?’ 


S T > 22(0.2529) — 2(0.2391) — 0.1376 — 0.12 
(4) 
> 0.853. 
So we squander more than 7ér + 0.644, as claimed. 

Case 2b. We now assume that there are no flat quarters with central vertex v2, ..., Us. 
We claim that v, is not enclosed over {0, vj, v2, v3} or {0, vy, vs, v4}. In fact, if vy Is 
enclosed over {0, v1, v2, v3}, then we reach the contradiction!”® 

m < dih(0, vi2, v1, v2) + dih(O, vj2, v2, v3) 
< 1634+1.51 <dz. 


We claim that vj. is not enclosed over {0, v5, vj, v2}. Let S; = {0, vj2, v1, v2} and 
S = {0, vj2, v1, vs}. We have by hypothesis, 


ya(Si1) + ya(S2) = |vi — v2] + |v1 — v5] < 4.804. 
An interval calculation!” gives 


y- dih(s;) < }~ Gih(S;) + 0.5(0.4804/2 — y4(S;))) 
(2) (2) 


< . 


So vj is not enclosed over {0, v1, v2, vs}. 


126 CaLc-644534985, CALC-467530297, and CALC-603910880. 
127 CaLc-135427691. 

128 CaLC-821707685 and CALC-115383627. 

129 CaLc-69064028. 
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Erase all upright quarters, taking penalties as required. Replace all flat quarters with 
s-vorg-scoring taking penalties as required. (Any flat quarter has v as its central vertex.) 
We move v12 keeping |v12| fixed and not decreasing |v;2 — v;|. The only effect this has 
on the score comes through the quoins along {0, v1, vj2}. Stretching |vj2 — v,| shrinks 
the quoins and increases the score. (The sign of the derivative of the quoin with respect 
to the top edge is computed in the proof of Lemma 12.9.) 

If we stretch |v;2 — v;| to length 2%), we are done by Case | and Case 2a. (If defor- 
mations produce a flat quarter, use Case 2a, otherwise use Case 1.) By the claims, we 
can eventually arrange (reindexing if necessary) so that 


(1) |vj2 — v3] = |vy2 — v4l = 2%, or 
(ii) [vig — v3] = |vi2 — v5] = 2¢0. 


We combine this with the deformations of Section 12.8 so that in case (i) we may also 


assume that if |vs — vi2| > 29, then |v4 — vs] = |v5 — v,| = 2 and that if |v2—v12| > 2%, 
then |v; — v2| = |v2 — v3| = 2. In case (ii) we may also assume that if |v4 — vj2| > 260, 
then |v3 — v4| = |v4—v5| = 2 and that if |v2—v42| > 2%, then |v; — v2| = |v2 —v3| = 2. 


Break the pentagon into subregions by cutting along the edges (vj, v;) that satisfy 
|vi2 — v;| < 2to. So for example in case (i), we cut along (v12, v3), (Vi2, U4), (V12, V1), 
and possibly along (v12, v2) and (v12, vs). This breaks the pentagon into triangular and 
quadrilateral regions. 

In case (ii) if |v4 — vj2| > 2%, then the argument used in Case 1 to show that 
|v4 — V12| < 3.488 applies here as well. In case (i) or (ii) if |vjg — v2| > 2%, then for 
similar reasons, we may assume 


A(\u12 — val”, 4, 4, 8, (2t0), |vi2 — vi |?) = 0. 


130 


This justifies the hypotheses for the calculations'~” that we use. We conclude that 


— T > 27 (0.2529) — 3(0.1453) — 2(0.2391) > 0.6749. 


If the penalty is less than 0.067 = 0.6749 — 0.6079, we are done. 

We have ruled out the existence of all loops except (4, 1). Note that a flat quarter with 
central vertex v; gives penalty at most 0.02 by Lemma 11.29. If there is at most one such 
flat quarter and at most one loop, we are done: 


3&r +. 0.02 < 0.067. 


Assume there are two loops of context (n,k) = (4, 1). They both lie along the edge 
{v1, vi2}, which precludes any unmasked flat quarters. If one of the upright diagonals 
has height > 2.696, then the penalty is at most 3&r + 3&y < 0.067. Assume both 
heights are at most 2.696. The total interior angle of the exceptional face at v, is at least 
four times the dihedral angle of one of the flat quarters along {0, v;}, or 4(0.74) by an 
interval calculation.'*! This is contrary to the hypothesis of an interior angle < 2.89. 
se This completes Case 2. This shows that heptagons with pentagonal hulls do not occur. 


130 CaLc-312132053 and CALC-644534985. 
131 Catc-751442360. 


232 


Sphere Packings, IV 165 


Lemma 14.6. Let R be an exceptional standard region. Let V be a set of vertices of 
R. If v € V, let p, be the number of triangular regions at v and let qy be the number of 
quadrilateral regions at v. Assume that V has the following properties: 


. No two vertices in V are adjacent. 

. No two vertices in V lie on a common quadrilateral. 

. Ifv € V, then there are five standard regions at v. 

. Ifv € V, then the corner over v is a central vertex of a flat quarter in the cone 

over R. 

5. Ifv € V, then py > 3. That is, at least three of the five standard regions at v are 
triangular. 

6. If R’ # R is an exceptional region at v, and if R has interior angle at least 1.32 
at v, then R’ also has interior angle at least 1.32 at v. 

7. If (Pv, @v) = (3, 1), then the internal angle at v of the exceptional region is at 

most 1.32. 


Define a: N > R by 


BRWNe 


14.8, n=0, 1,2, 


1.4, n=3, 
= 145 n=4 
0, otherwise. 


Let { F} be the union of {R} with the set of triangular and quadrilateral regions that have 
a vertex at some v € V. Then 


Yo te(D) > Yo (pd (3) + Gud (4) + a(py)) pt 
F 


veV 


Proof. We erase all upright diagonals in the Q-system, except for those that carry a 
penalty: loops, 3-unconfined, 3-crowded, and 4-crowded diagonals. 

We assume that if (p,, gv) = (3, 1), then the internal angle is at most 1.32. Because of 
this, if we score the flat quarter by vorg, then the flat quarter Q satisfies (Lemma 11.30) 


voro(Q) > 3.07 pt > 1.4pt+ D(3, 1) + 2éy + ép. (14.1) 


Every flat quarter that is masked by a remaining upright quarter in the Q-system has 
y4 > 2.6. Moreover, y; > 2.2 or yg > 2.7. Let m7, = 2&y + & if the flat quarter is 
masked, and z, = 0 otherwise. 

We claim that the flat quarter (scored by vorg) together with the triangles and quadri- 
laterals at a given vertex v squander at least 


(Prd (3) + Gud (A) + a(py)) pt + DGB, 1) +m. (14.2) 


If py = 4, this is CALC-314974315. If p, = 3, we may assume by the preceding remarks 
that there are two exceptional regions at v. If the internal angle of R at v is at most 
1.32, then we use inequality (14.1). If the angle is at least 1.32, then by hypothesis, the 
angle R’ at v is at least 1.32. We then appeal to the calculations CALC-675785884 and 
CALC-193592217. 
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To complete the proof of the lemma, it is enough to show that we can erase the upright 
quarters masking a flat quarter at v without incurring a penalty greater than z:,. For then, 
by summing the inequality (14.2) over v, we obtain the result. 

If the upright diagonal is enclosed over the masked flat quarter, then the upright 
quarters can be erased with penalty at most y (by Remark 11.28). Assume the upright 
diagonal is not enclosed over the masked flat quarter. 

If there are at most three upright quarters, the penalty is at most 2) + €p. Assume four 
or more upright quarters. If the upright diagonal is not a loop, then it must be 4-crowded. 
This can be erased with penalty 


2&y + 2 — Kk < 2Ey + Er. 


Finally, assume that the upright quarter is a loop with four or more upright quarters. 
Lemma 13.5 limits the possibilities to parameters (5, 0) or (5, 1). In the case of a loop 
(5, 1), there is no need to erase because |V| < 3 and by Lemma 13.5 the hexagonal 
standard region squanders at least 


te + 3a(py) pt 


as required by the lemma. In the case of a loop (5, 0) in a pentagonal region, if |V| = 1 
then there is no need to erase (again we appeal to Lemma 13.5). If |V| = 2, then the two 
vertices share a penalty of 4&y + &p, with each receiving 


2Ey + ép/2 =< 2éy + ép. 
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Abstract. This paper is the fifth in a series of papers devoted to the proof of the Kepler 
conjecture, which asserts that no packing of congruent balls in three dimensions has density 
greater than the face-centered cubic packing. 

In this paper we prove that decomposition stars associated with the plane graph of 
arrangements we term pentahedral prisms do not contravene. Recall that a contravening 
decomposition star is a potential counterexample to the Kepler conjecture. We use interval 
arithmetic methods to prove particular linear relations on components of any such con- 
travening decomposition star. These relations are then combined to prove that no such 
contravening stars exist. 


Introduction 


Pentahedral prisms come remarkably close to achieving the optimal score of 8 pt, that 
achieved by the decomposition stars of the face-centered cubic lattice packing. In this 
sense we consider pentahedral prisms to be “worst-case” decomposition stars. 

Pentahedral prisms constituted a counterexample to an early version of Hales’s ap- 
proach to a proof of the Kepler conjecture, and have always been a somewhat thorny 
obstacle to the proof of the conjecture. Relations required to treat pentahedral prisms are 
delicate in contrast to the more general bounds which suffice to treat other decomposition 
stars. 


15. Pentahedral Prisms 
Recall that a contravening decomposition star is a potential counterexample to the Kepler 


conjecture. The subject of this paper is a particular class of potentially contravening 
decomposition stars. 
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We use the term pentahedral prisms to refer to this class of potentially contravening 
decomposition stars, and refer to a decomposition star in this class as a pentahedral 
prism. This class is defined by the plane graph in Fig. 15.1. 

An example of an arrangement with such a graph is depicted in Fig. 15.2. 

A pentahedral prism is characterized by the arrangement and combinatorics of its 
standard regions. It is composed of ten triangular standard regions, and five quadrilateral 
standard regions. 

The ten triangles are arranged in two pentahedral caps, five triangles arranged around 
acommon vertex. The five quadrilaterals lie in a band between the two caps. See Fig. 15.3. 

Recall that the standard cluster attached to a triangular standard region is a quasi- 
regular tetrahedron. Likewise, the standard cluster attached to a quadrilateral is a quad 
cluster. We use the term pentahedral cap to refer to both the standard regions and the 
quasi-regular tetrahedrons which comprise it. 


15.1. The Main Theorem 


We begin by recalling various definitions from the second paper, Formulation [HF]. The 
constant pt is introduced in Definition 3.6. Similarly, score is defined in Theorem 3.5, 
as well as Definition 7.8 and Remark 7.20. We denote the score of a region R by o(R). 


Theorem 15.1. Each pentahedral prism P satisfies 
o(P) < (8 — £0) pt 
for & = 10~*. Hence there are no contravening pentahedral prisms. 


The next section introduces a series of propositions which prove the main theorem. 
The first proposition restricts our attention to a set of potentially contravening pentahedral 
prisms. Subsequent propositions provide a collection of relations which we use to prove 
the main theorem. 


15.2. Propositions 


The function sol(-) is introduced in Definition 7.5. The function dih(-) is introduced in 
Definition 4.12. 

We present computations using auxiliary bounds which imply the main result of the 
paper, that the score of any pentahedral prism is strictly less than 8 pt. 

Recall from Section 7.4 that the score decomposition for a decomposition star S$ takes 
the form 


a(S) =) a(R), 


R 


where R runs over the standard clusters in S. 
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Fig. 15.1. The plane graph of a pentahedral prism. 


Fig. 15.2. Spheres in a pentahedral prism arrangement. 


Fig. 15.3. The faces of a pentahedral prism. 


239 


170 S. P. Ferguson 


In the case of a pentahedral prism P, the score o(P) decomposes as 


10 


5 
o(P)= > o(f)) + > 0(Q)) 


i=l j=l 


with the triangular regions 7; numbered so that the two pentahedral caps C; consist of 
{7;: 1 <i <5}, {Tj;: 6 <i < 10} and Q; denotes the quad clusters. Thus 


5 10 
o(C\)= J o(%), (Cr) = ) oT). 
i=1 i=6 


The following proposition gives basic inequalities which we use to form a restricted 
set of pentahedral prisms. 
Proposition 15.2. A pentahedral prism P satisfies the bound 
o(P) < (8 — £0) pt 


for & = 10-8 provided that any one of the following conditions holds: 


1. P contains a tetrahedron T such that 


o(T) < —0.52 pt. 


2. P contains a quad cluster Q such that 


o(Q) < —1.04pt. 


3. P contains a pentahedral cap C such that 


a(C) < 3.48 pt. 


Proof. We use the following scoring bounds proved earlier for any admissible decom- 
position star. 
First, Lemma 8.10 states that any quasi-regular tetrahedron T satisfies 


o(T) < |pt. 
Theorem 8.4 states that any quad cluster Q satisfies 
o(Q) <9. 


Next, a pentahedral cap C consists of five quasi-regular tetrahedra 7; sharing a com- 
mon distinguished edge. At one end of the distinguished edge is the distinguished vertex 
v = 0 which is the center of the decomposition star P. Each 7; has context c; = (5, 0). 
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Lemmas 10.6 (with k = 1 and r = 5) and 10.7 state that any pentahedral cap C; 
satisfies 


3 
o(Ci) = oT, cj, v) < (4.52 — €0) pt 


i=l 
with ¢9 = 107°. 
1. Suppose that some o(T) < —0.52 pt, with T contained in a pentahedral cap C). 


Then the inequalities above give 


5 


—0.52 pt + 4(1 pt) + o (Cz) + ¥+0(Q)) 


j=l 


3.48 pt + (4.52 — £9) pt + 5(0) = (8 — £0) pt. 


Q 
lA 


IA 


2. Suppose that some quad cluster o(Q;) < —1.04 pt. Then 


a(P) < a(C\) +a(C2) + (—1.04 pt) + 4(0) 
2((4.52 — €0) pt) — (1.04 pt) = (8 — 2e0) pt. 


IA 


3. Suppose that some pentahedral cap C; has o(C;) < 3.48 pt. Then the inequalities 
above give 


o(P) < (3.48 pt) + (4.52 — €9) pt + 5(0) = (8 — €0) pt. 


This completes the proof. O 


Definition 15.3. A PC pentahedral prism is a pentahedral prism such that: 


1. All tetrahedra T have o(T) > —0.52 pt. 
2. All quad clusters have o(Q) > 1.04 pt. 
3. All pentahedral caps have o(C) > 3.48 pt. 


The configuration arises as a pointwise limit of configurations in which (1)-(3) hold with 
strict inequality. A strict PC pentahedral prism is one in which (1)-(3) each hold with 
strict inequality. 


All remaining propositions apply to PC pentahedral prisms. This restriction improves 
the quality of the bounds which we are able to prove on components of a pentahedral 
prism. 

The following two propositions contain linear relations which imply the main theorem. 
We defer their proofs to the next section. 


Proposition 15.4. For a quasi-regular tetrahedron T in a PC pentahedral prism, the 


following linear inequality holds between o (T), the spherical angle sol(T ) (at the central 
vertex common to the five tetrahedra in the pentahedral cap), and the dihedral angle 
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dih(T) associated with the first edge of the tetrahedron (that is, the edge common to the 
five tetrahedra in a pentahedral cap): 


2 
o(T)+msol(T) +a (ancy = =) ah 20. 
Here a = 0.0739626, b. = 0.253095, and m = 0.3621. 


Proposition 15.5. For a quad cluster Q in a PC pentahedral prism, the following 
linear inequality holds between o (Q) and the spherical angle sol(Q): 


o(Q)+msol(Q) — by < 0. 


Here by = 0.49246 and again m = 0.3621. 


From Propositions 15.4 and 15.5 we can deduce the following theorem. 


Theorem 15.6. Each PC pentahedral prism P satisfies the score bound 


o(P) < 7.9997 pt. 


Proof. Propositions 15.4 and 15.5 provide linear relations on all of the standard clusters 
in a PC pentahedral prism P. We combine these relations to prove the required score 
bound. 

Invoking Proposition 15.4 for the five quasi-regular tetrahedrons {7}: i = 1...5} 
from a pentahedral cap, we find 


5 


B) 5 
Yio) +m) soll) +a )~ (ain = =) — 5b, < 0. 


i=1 i=l i=1 


Summing over both pentahedral caps and using the relation that the sum of the five 
dihedral angles in a pentahedral cap is 277, 


5 
> dih(7;) = 27, 
i=l 
we find 
10 10 


yi o(N) +m Y= sol(7;) — 10b, < 0. 


i=l i=l 
We represent the tetrahedra from the second pentahedral cap by the indices i = 6... 10. 


Invoking Proposition 15.5 for the five quad clusters {Q;: i = 11... 15}, and using 
the fact that the sum of the solid angles is 477, 


10 15 


> sol(Q;) + > sol(Q;) = 47, 
i=1 


j=l 
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we find 


10 15 
Yo(T;) + >\ o(Q;) + 4m — 5b — 10b, < 0. 
i=l j=ll 

Therefore, 


o(P) <5b+ 10b, — 42m. 


Substituting the values of b, b.,m, and pt, we find that the score of a PC pentahedral 


prism is less than 7.9997 pt. 


Assuming Proposition 15.2 and Theorem 15.6 we can prove Theorem 15.1. 


Proof of Theorem 15.1. Given a pentahedral prism P, it is either PC or it is not. In the 
former case its score is bounded by 7.9997 pt. In the latter case its score is bounded by 


(8 — 107%) pt. In both cases its score is bounded by (8 — 107%) pt. 


Remark 15.7. The score bound in Theorem 15.1 is weaker than what is possible to 
prove. In the interest of simplifying the exposition as well as the required computations, 
we establish this weaker bound which suffices for this part of the proof of the Kepler 
conjecture. 


16. The Main Propositions 


In the first section we recall the definition of score, and introduce some local notation. 
In the next section we recall the notion of dimension reduction, and prove its validity for 
some relevant cases. In the following section we prove Proposition 15.4. In the remaining 
sections we prove Proposition 15.5. 


16.1. Scoring 


The development of a scoring function is central to the proof of the Kepler conjecture. 
Its definition is therefore somewhat complicated. Fortunately, in our treatment of the 
pentahedral prism we are able to restrict our attention to only a few cases in the scoring 
system. 

Recall that score is defined in Theorem 3.5, as well as Definitions 7.6 and 7.8 and 
Remark 7.20. See Remark 7.23 for a simplified version of the scoring function for 
quarters. 

In our context, the score o(-) breaks into four different scoring types: gma(-), vor(-), 
octavor(-), and Voronoi. 

gma(-) applies to quasi-regular tetrahedrons and quarters, and is introduced as I"(-) 
in Definition 7.6. We frequently use the term compression as an alias for gma(-). This 
alias was introduced in Section 7.6. 
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vor(-) is the score determined by the analytic continuation of the Voronoi volume 
associated with the distinguished vertex of a tetrahedron, and corresponds to s-vor(-) in 
Definition 7.6. 

We let octavor(-) denote the score of an upright quarter in context (4, 0) which is not 
scored by compression. In this case, octavor(-) is the average of two vor(-) scores. 

Voronoi scoring, which we also refer to as pure Voronoi scoring, is vorr(D) from 
Remark 7.20. 


16.2. Dimension Reduction 


The relations on tetrahedra required for the scoring bound on decomposition stars are 
typically six-dimensional, as they are formulated in terms of the edge lengths of a tetra- 
hedron. For a quad cluster, they can be even higher-dimensional. For high-dimensional 
relations, the method of subdivision becomes very expensive, computationally speaking. 

We define a simplification which reduces the dimension of the required computations. 
This simplification therefore reduces the computational expense of the verification of a 
relation. 

We refer to this simplification as dimension-reduction. We apply this simplification 
for three different scoring types: compression, vor analytic, and Voronoi. These scoring 
functions are introduced in Definitions 7.6 and 7.8. See Remark 7.23 for a simplified 
version of the scoring function for quarters. 


Theorem 16.1 (Dimension-Reduction). Given a tetrahedron T with a fixed scoring 
type (one of compression, vor analytic, or Voronoi), the deformation consisting of mov- 
ing a vertex v; along the edge (0, v;) towards the origin increases the score of the 
tetrahedron. 


Note that this deformation holds the solid angle at the origin fixed. See Fig. 16.1. Since 
the reduction may be performed until either a scoring system or an edge-length constraint 
is met, this argument reduces the number of free parameters for the verification, thus 
reducing the dimension and complexity of the verification of a relation. 


Proof. There are three cases to consider: compression scoring, vor analytic scoring 
and Voronoi scoring. This technique was introduced in Proposition 8.7.1 of [Ha6] for 
compression-scoring, and is proved there in the compression case. 

Next we consider vor analytic-scored tetrahedra. The validity of the same reduction for 
vor analytic-scored tetrahedra is obvious if the tip of the Voronoi cell does not protrude. 
If the tip does protrude, we must use the analytic continuation for the Voronoi volume. 
In this case the validity of the reduction is not obvious. 

The geometric constraint of moving a vertex along an edge can easily be stated 
analytically in terms of the original edge lengths, (91, y2, y3, y4, Ys, yo). This action 
depends on a single parameter, the distance of the vertex v; from the origin, which we 
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Vy 


V; 


Fig. 16.1. Tetrahedron with distinguished vertex and labeled edges. 


call t. The new edge lengths are given by 


2 2 2 2 2 2 
t(yi* + y3 =). Jet iyi ya Fe) 


lyase ft? 
Bal y1 


Recall from Section 8.6.3 of [Ha6] that the formula for the analytic Voronoi volume 
is a rational function of x, u, A, and x;, where x; = y?. Further recall that x, u, and 
A are all polynomial functions in x; that are defined in Sections 8.1 and 8.2 of [Ha6]. 

Substituting the computed edge lengths in the formula for the analytic Voronoi volume, 
taking the partial derivative with respect to t, replacing t with y,, multiplying by the 
positive term 


BV Au(x1, x3, X5)U(x1, x2, X6)/Y1, 


and then simplifying, we end up with a large homogeneous polynomial in x; of degree 
6, which is too ugly to exhibit here (having ninety-one terms). 

Evaluating this polynomial over all possible quasi-regular tetrahedrons and quarters, 
we find that it is positive. 

Therefore the volume is increasing in f, so to increase the score we should push 
the vertex in along the edge. The verification of the sign of the polynomial is found in 
Calculation 17.4.5.1. This completes the case of vor analytic-scored tetrahedra. 

The final case is Voronoi scoring. The deformation does not change the solid an- 
gle of the tetrahedron. The only term of the Voronoi score that changes is a negative 
constant times a volume. The contraction of the tetrahedron decreases this volume, and 
increases the score. The validity of a similar reduction argument for Voronoi scoring of 
a tetrahedron is now obvious. 
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16.3. Proof of Proposition 15.4 


It suffices to prove Proposition 15.4 for strict PC pentahedral prisms. Each nonstrict PC 
pentahedral prism is a pointwise limit of strict ones of the same combinatorial type, so 
the inequality in the conclusion of the proposition will hold for nonstrict PC pentahedral 
prisms by continuity. 

We use three separate computations to construct and prove Proposition 15.4. First, 
we prove a relation between dihedral angle and score. We then show that if the dihedral 
angle of a tetrahedron in a pentahedral cap exceeds a certain bound, then the associated 
pentahedral prism is not a strict PC pentahedral prism. We call such a bound a dihedral 
cutoff. This cutoff allows us to prove the final bound. 

In the following discussion, dih(7) refers to the dihedral angle associated with the 
first edge of a quasi-regular tetrahedron T, o(T) refers to the compression score of the 
tetrahedron, and sol(T) refers to the solid angle at the distinguished vertex. We restrict 
our attention to quasi-regular tetrahedrons whose score exceeds —0.52 pt, as otherwise 
the associated pentahedral prism cannot contravene. 

The first relation is 


a(T) < a, dih(T) — a, (16.1) 


where a; = 0.3860658808124052 and az = 0.4198577862. Calculation 17.4.1.1 pro- 
vides the verification of this relation. 


Lemma 16.2. /f a pentahedral prism has a pentahedral cap that contains a quasi- 
regular tetrahedron T with dihedral angle dih(T) > do, where dy = 1.4674, then it is 
not a strict PC pentahedral prism. 


Proof. Applying relation (16.1) to four quasi-regular tetrahedrons T; forming part of a 
strict PC pentahedral prism, we find 


4 4 

yo(T) <a y> dih(7;) — day. (16.2) 
i=l i=l 

Applying the relation 


4 
dih(Ts) = 27 — > dih(7;) (16.3) 


i=1 


and adding o (75) to both sides of relation (16.2), we find 
5 
a o(T;) < o(Ts) + a)\(2a — dih(Ts)) — 4a. (16.4) 
i=l 


The left-hand side represents the score of the pentahedral cap. If the right-hand side does 
not exceed 3.48 pt, then the pentahedral prism is not a strict PC pentahedral prism. 
We assert that if dih(7’) > do, the right-hand side 


o(Ts) + a; (20 — dih(Ts)) — 4a 
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does not exceed 3.48 pt. Equivalently, we prove that dih(T) > dp implies 
o(T) — a, dih(T) < 3.48 pt — 27a, + 4a, 


which is verified in Calculation 17.4.1.2. 
We conclude that if dih(7) > do then the pentahedral prism cannot be a strict PC 
pentahedral prism. O 


Hence we may restrict our attention to quasi-regular tetrahedrons whose dihedral 
angle does not exceed the dihedral cutoff do. 
Using the dihedral cutoff, we establish the final relation, 


o(T)+msol(T) + a(dih(T) — 27/5) —b. < 0 


via Calculation 17.4.1.3. This completes the proof of Proposition 15.4. 


16.4. Proof of Proposition 15.5: Top Level 


It suffices to prove Proposition 15.5 for strict PC pentahedral prisms, by a similar argument 
to that used for Proposition 15.4. 

Recall from Definition 7.15 that a quad cluster is a standard region that is a quadri- 
lateral. Quad clusters can be classified as follows: 


1. Flat quad clusters. 

2. Octahedra. 

3. Pure Voronoi quad clusters. 
4. Mixed quad clusters. 


We subdivide the third class into acute and obtuse types. See Section 10.4 for a 
discussion on the classification of quad clusters. By Lemma 10.14, the score of a mixed 
quad cluster is less than —1.04 pt. A PC pentahedral prism therefore cannot contain a 
mixed quad cluster, so the bound of Proposition 15.5 holds trivially for this class. 

We treat the remaining classes in the following sections. 


16.5. Proof of Proposition 15.5: Flat Quad Clusters 
Recall that a flat quarter is a quarter whose long edge is opposite its distinguished vertex. 
Lemma 16.3. Given a flat quarter Q with o(Q) > —1.04 pt, the relation 

o(Q) < —msol(Q) + b/2 (16.5) 
holds. 
Proof. Label the diagonal of a flat quarter ye. 


Flat quarters may be scored using either compression or vor scoring. We treat each 
case separately. 
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First, suppose that we wish to prove the bound for compression scored quarters. This 
means that the circumradii of the two faces adjacent to the long diagonal do not exceed 
/2. We subdivide the verification into Calculation 17.4.2.1, a computation where we 
apply dimension-reduction and partial derivative information, and Calculation 17.4.2.2, 
a boundary verification, where we restrict our attention to cells which lie on the boundary 
between compression and vor scoring. 

Second, we treat the vor-scoring case. In this case we prove the bound for vor-scored 
quarters. This means that at least one of the circumradii of the two faces adjacent to 
the long diagonal is at least /2. This verification is somewhat more complex than the 
compression case. We subdivide the verification into 


1. Verification that the first three partials are negative on a small cell containing the 
corner (Calculation 17.4.2.3). 

2. Verification of the bound on that small cell containing the corner, using the property 
that the first three partials are negative (Calculation 17.4.2.4). 

3. A computation where we apply dimension-reduction and partial derivative reduc- 
tion, omitting the corner cell (Calculation 17.4.2.5). 

4. A boundary verification, where we restrict our attention to cells which lie on the 
boundary between compression and vor scoring, again omitting the corner cell 
(Calculation 17.4.2.6). 


These calculations complete the proof of Lemma 16.3. 


We are now prepared to prove Proposition 15.5 for flat quad clusters. 

Flat quad clusters are composed of two flat quarters, whose common face includes 
the long edge. 

By Proposition 15.2, we restrict our attention to flat quarters whose score exceeds 
—1.04 pt, recalling the fact that the score of flat quarters is nonpositive. 

Invoking Lemma 16.3 for each flat quarter and adding the relations, we arrive at the 
desired bound for flat quad clusters. This completes the proof. 


16.6. Proof of Proposition 15.5: Octahedra 


Recall that quartered octahedra, a type of quad cluster, are composed of four upright 
quarters arrayed around their common long edge (called the diagonal) so that each face 
containing the common edge is shared by two quarters. 

We are required to prove a relation of the form 


o(H)+msol(A) —b <0, 


where o(H) denotes the score of an octahedron H, sol(H) denotes the solid angle 
associated with the distinguished vertex, and m and b are positive constants. By Propo- 
sition 15.2, we restrict our attention to octahedra whose score exceeds — 1.04 pt. 

Our treatment of octahedra, as usual, is comprised of a number of auxiliary compu- 
tations. We prove bounds on upright quarters which are part of an octahedron, and then 
combine these bounds to deduce the required bound on octahedra in general. 
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The scoring function o(-) for upright quarters is either compression (denoted by 
gma(-)) or an average of two vor(-) scores, which we continue to refer to as vor-scoring. 
See Remark 7.23 for a simplified version of the scoring rules. 

Due to the complex nature of octahedra, we consider a number of subcases. These 
cases are partitioned according to the length of the diagonal and the scoring system 
applied to the upright quarters. 

Using a dihedral summation argument, we eliminate octahedra whose diagonal lies 
in the range [2.51, 2.716]. 

Next, we treat the case where the diagonal lies in the range [2.716, 2\/2]. Using 
a dihedral correction term, we prove the bound for octahedra which are completely 
compression-scored, and octahedra which are completely vor-scored. 

The remaining cases consist of octahedra which contain either two or three vor-scored 
quarters. (Since a quarter is vor-scored if one of the faces containing the diagonal has 
circumradius 2 or greater, it is not possible for an octahedron to contain only one 
vor-scored quarter.) We treat these cases using an additional correction term. 

In all computations involving octahedra, we label the diagonal y,. 

In order to simplify the computations, we first prove an auxiliary cutoff bound. This 
first bound reduces the size of the cell over which we must conduct our search, as per 
Proposition 15.2. 


Lemma 16.4. Jf an upright quarter contains an edge numbered 2, 3, 5, or 6 whose 
length is not less than 2.2, then its score is less than or equal to —0.52 pt. 


Proof. This is Calculation 17.4.3.1. O 


Since such an edge is shared by another upright quarter in the same octahedron, the 
score of the associated octahedron must fall below — 1.04 pt. 
We restrict our search accordingly. 


Lemma 16.5. The score of an octahedron H with upright diagonal in the range 
[2.51, 2.716] is less than or equal to —1.04 pt. 
Proof. (n Calculation 17.4.3.2 we prove a bound of the form 

o(S)+cdih(S) <d 


on upright quarters S$, where c = 0.1533667634670977 and d = 0.2265. Adding the 
bound for four quarters S; forming an octahedron, we find 


4 


4 
y- o(Si) + ey. dih(S;) < 4d. 


i=l i=l 
Using the fact that the sum of the dihedral angles is 277, we find that 
o(A) < —2mc + 4d 


for such an octahedron H. 
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A computation involving the constants c and d shows that the score is less than 
—1.04 pt. 


Again invoking Proposition 15.2, we need only consider octahedra whose diagonal 
lies in the range [2.716, 2/2]. 
Using this assumption, we prove bounds of the form 


b 
o(S) + msol(S) + a dih(S) < 7+ as (16.6) 
and 
b 
o(S) +m sol(S) +a dih(S) + Bx < 3+ as + 8B, (16.7) 


where dih(S) refers to the dihedral angle associated with the diagonal, o (S) refers to the 
scoring scheme appropriate for a particular upright quarter S, and x; refers to the square 
of the length of the diagonal. We choose a and £ according to the scoring scheme. 

Appropriate values for the correction terms involving a and f were determined by 
experimentation. 

Choosing a = 0.14, we prove (16.6) for compression-scored quarters with diagonal 
in the interval [2.716, 2/2] (Calculation 17.4.3.3). Using the same a, we prove (16.6) 
for vor-scored quarters with diagonal in the range [2.716, 2.81] (Calculation 17.4.3.4). 

Choosing a = 0.054, B = 0.00455, we prove (16.7) for compression-scored quar- 
ters with diagonal in [2.81, 2/2] (Calculation 17.4.3.5). Choosing the same a, but 
B = —0.00455, we prove (16.7) for vor-scored quarters with diagonal in [2.81, 2/2] 
(Calculation 17.4.3.6). 

Note that for vor-scored quarters, the first inequality is a relaxation of the second, 
since 6 is negative. 

The verification of each of these inequalities involves a computation where we apply 
dimension-reduction and partial derivative information, and a boundary verification, 
where we restrict our attention to cells which lie on the boundary between compression 
and vor analytic scoring. Note that the dimension-reduction step for relation (16.7) is 
complicated by the presence of the 6x, term. 


Lemma 16.6. Proposition 15.5 holds for octahedra with upright diagonals in the range 
(2.716, 2/2]. 
Proof. Summing inequality (16.6) over the four quarters S; of an octahedron, we find 
4 4 4 
Y *o(S;) +m)" sol(S;) +. )° dih(S;) < b+ 2az. 
i=l i=1 i=l 
Using the fact that the dihedral angles sum to 277, we find 
o(H)+msol(H) < b, 


so octahedra H with diagonals in the range [2.716, 2.81] satisfy the requisite bound. 
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Summing inequality (16.6) over a consistently scored octahedron (either all compres- 
sion or all vor) with diagonal in the range [2.81, 2/2], we again arrive at the desired 
bound. 

The remaining cases involve octahedra which contain both compression and vor- 
scored quarters, and whose diagonals lie in the range [2.81, 2\/2]. For this case we use 
inequality (16.7). 

The summation involving inequality (16.7) is identical to inequality (16.6) save for the 
presence of the 6 terms. If there are two vor-scored quarters and two compression-scored 
quarters, the beta terms cancel, giving the relation as before. 

If there are three vor-scored quarters and one compression-scored quarter, we note 
that the same relation for vor-scored quarters holds if we replace 8 by 6/3 (since we 
have now relaxed the bound). Summing the inequalities, the term involving 6 vanishes 
again, leaving the desired inequality. 


Lemmas 16.5 and 16.6 prove Proposition 15.5 for octahedra. 


16.7. Proof of Proposition 15.5: Pure Voronoi Quad Clusters 


The next class of quad clusters which we treat are the pure Voronoi quad clusters. We 
will define a truncation operation on these quad clusters. Truncation will simplify the 
geometry of the quad clusters, and will provide a convenient scoring bound. We divide 
our treatment of pure Voronoi quad clusters into two cases in order to simplify the 
analysis and numerical verifications as much as possible. 

Recall from the classification of quad clusters that a pure Voronoi quad cluster consists 
of the intersection of a V-cell at the origin with the cone at the origin over a quadrilateral 
standard region. We refer to the restriction of the V -cell to the cone over the quadrilateral 
as either the V-cell or the Voronoi cell of the quad cluster. Figure 16.2 describes the 
geometry of a simple V-cell. 


Fig. 16.2. A pure Voronoi quad cluster. 
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In addition, recall that a vertex lying in the cone over a pure Voronoi quad cluster must 
have height greater than 2\/2. Such vertices can significantly complicate the geometry 
of the V-cell, affecting its shape and volume. 

We remove the effect of vertices lying above a pure Voronoi quad cluster by removing 
all points from the V-cell which have height greater than 2. We call this operation 
truncation at \/2. Truncation decreases the volume of the quad cluster. This decrease in 
volume increases the score of the quad cluster, bringing it closer to the proposed bound. 

We refer to truncated pure Voronoi quad clusters as truncated quad clusters. 

We define a scoring operation on pure Voronoi quad clusters which we call truncated 
Voronoi scoring. This operation consists of truncation at /2, followed by the usual 
Voronoi scoring. 

Each diagonal across the face of a cluster must have length greater than 2/2, otherwise 
we could form two flat quarters, contradicting the decomposition. We choose the shorter 
of the two possible diagonals, and consider that diagonal in the analysis which follows. 

We decompose the cluster into two tetrahedrons along the chosen diagonal. The face 
dividing the tetrahedrons is either acute or it is obtuse. We treat each case separately. 

We must prove 


o(Q)+msol(Q) —b < 0, 
where o(Q) denotes the score of a pure Voronoi quad cluster Q, sol(Q) denotes the 
solid angle associated with the distinguished vertex, and m and b are positive constants. 


We call this relation a bound on the solid angle and score of a quad cluster. Invoking 
Proposition 15.2, we restrict our attention to quad clusters whose score exceeds — 1.04 pt. 


16.8. Pure Voronoi Quad Clusters: Acute Case 


Lemma 16.7. [fan acute quad cluster is divided along an acute separating face, then 
the score of each half is nonpositive. 


Proof. This is a consequence of the arguments of Theorem 8.4. O 


We therefore restrict our attention to halves whose score exceeds — 1.04 pt, by Propo- 
sition 15.2. 
If the separating face is acute, we prove 


o(S;) +msol(S;) — b/2 < 0 (16.8) 
for each half S; independently, and deduce the desired bound by adding the bounds for 
each half. 


Lemma 16.8. Let To denote the tetrahedron with edge lengths (2,2, 2,2, 2, 2/2). 
Let sol(Io) denote the solid angle of the tetrahedron Tp. Given a tetrahedron T, if 
sol(T) < sol(To), then relation (16.8) holds. 


Proof. Tf sol(T) < sol(7o), then msol(T) < m sol(To), hence 
msol(T) — b/2 < msol(To) — b/2 < 0 
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and 
o(T)+msol(T) — b/2 < o(T) <0, 


by Lemma 16.7. 


We therefore may restrict our attention to halves whose solid angle is at least sol(7p). 
In addition, we restrict our attention to halves for which the dividing face is acute. 


Lemma 16.9. The relation 
o(T)+msol(T) — b/2 < 0 


holds for a tetrahedron T forming half of an acute quad cluster with score exceeding 
—1.04 pt. 


Proof. The required verifications for each half of an acute quad cluster are somewhat 
difficult to achieve directly, so we subdivide into a number of different cases in an attempt 
to reduce the complexity of the calculations. First, we show that the bound holds for all 
halves whose diagonal is at least 2.84 (Calculation 17.4.4.1). Using this information, 
we then prove the bound everywhere but in a small corner cell (Calculation 17.4.4.2). 
We then restrict our attention to the small corner cell (Calculation 17.4.4.3). These 
computations involve the use of partial derivative information, and include the required 
boundary computations. 


Invoking Lemma 16.9 for each half and adding them proves Proposition 15.5 for the 
acute case. 


16.9. Pure Voronoi Quad Clusters: Obtuse Case 


If the separating face is obtuse, the analysis becomes significantly harder. It is no longer 
possible to prove the desired bound on each half independently. The dimension of the 
full bound, even using the usual dimension-reduction techniques, is too high to make the 
verification tractable numerically. Therefore we adopt a different approach. 

Using the dimension-reduction technique, we push each vertex along its edge until the 
distance from each vertex to the origin is 2. We call the resulting quad cluster a squashed 
cluster. Observe that the solid angle of the cluster is unchanged, while the volume of the 
Voronoi cell has decreased, thereby increasing the score of the cluster. 

Since the central face is still obtuse, the length of the diagonal after this perturbation 
must still exceed 2/2. Note, however, that the other edge lengths in the quad cluster can 
be as small as 4/2.51. 

The geometry of the V-cell of a squashed cluster, assuming that there is no trun- 
cation from vertices of the packing lying above the quad cluster, is that of Fig. 16.2. 
When the V-cell is truncated at ./2 from the origin, two potential arrangements arise. 
In the first arrangement, the truncated region is connected, as in Fig. 16.3. In sec- 
ond potential arrangement, the truncated region is formed of two disjoint pieces, as in 
Fig. 16.4. 
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Fig. 16.3. A typical truncated quad cluster. 


Lemma 16.10. The disjoint case cannot arise for squashed quad clusters. 


Proof. Suppose that it could. Pick an untruncated point along the central ridge of the 
V-cell (see Fig. 16.4). The distance of this point from the origin is then less than /2, 
but due to its location on the central ridge, it is equidistant from the two nearest vertices 
and the origin. This implies that the circumradius of the resulting triangle must be less 
than \/2, which contradicts the fact that the diagonals have length at least 2/2. 


16.9.1. A Geometric Argument. We introduce a simplification which reduces the com- 
plexity of the obtuse case. This simplification consists of a perturbation of the upper edge 
lengths of a squashed quad cluster. This perturbation increases the score while holding 
the solid angle of the quad cluster fixed. 

This simplification is based on a geometric decomposition of the truncated Voronoi 
cell. We describe the decomposition, and then describe a construction which will ulti- 
mately simplify the analysis. 

While our arguments will extend to treat a general squashed and truncated Voronoi 
cell associated with a general standard cluster, we restrict our attention to truncated 
Voronoi cells associated with quad clusters. 


Fig. 16.4. An impossible arrangement. 
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Fig. 16.5. A representation of a truncated quad cluster. 


To begin, we consider the decomposition of a truncated Voronoi cell into its fundamen- 
tal components. A truncated Voronoi cell is formed of three elements: a central spherical 
section (formed by the truncation), wedges of a right circular cone, and tetrahedrons 
called Rogers simplices. 

We choose a representation of a truncated quad cluster composed of the radial pro- 
jection of each element to a plane passing close to the four corners of the quad cluster. 
This decomposition is represented in Fig. 16.5. 


16.9.2. Rogers Simplices. We now consider the geometry of the Rogers simplices. 
Consider a face with edge lengths (2, 2, t) associated with a side of a truncated quad 


cluster. Let b represent the circumradius of the face, and let r represent the orthogonal 
extension of a Rogers simplex from the face, as in Fig. 16.6. 


| 
Fig. 16.6. Detail of truncated Voronoi decomposition. 
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Fig. 16.7. Detail of Rogers simplex. 


Then 
7 4 
16 — #2 
pe = 16 — 272 
_— i6=e 
and 
t 
s=Vb?-—1= —_. 
Vv 16 — t2 
See Fig. 16.7. 


16.9.3. The Geometric Construction. We now present the geometric construction which 
implies the simplification. 

We represent the geometry of the truncated Voronoi cell associated with one-half of 
a quad cluster in Fig. 16.8. 

We can simplify the representation by extending the wedges to enclose the Rogers 
simplices. See Fig. 16.9. This process adds an extra volume term. 

The overlap between the wedges is slightly complicated. We simplify the overlap as 
follows. Take the cone over the overlap. Intersect it with a ball of radius ./2 at the origin. 
We call the spherical sections produced by this construction flutes. This construction is 
represented in Fig. 16.10. Figure 16.11 is a planar representation of this construction. 

To form each flute, we have added two extra pieces of volume (per flute) to our 
construction. We call these pieces quoins. We attach each quoin to a Rogers simplex. 
See Fig. 16.12. 


16.9.4. A Solid Angle Invariant. We now require some notation for the volumes which 
enter into this construction. Let c denote the volume of the central spherical angle. Let r 
denote the volume of the Rogers simplices. Let w denote the volume of the wedges. Let 
w’ denote the volume of the extended wedges. Let q denote the volume of the quoins. 
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Fig. 16.8. Decomposition of a truncated Voronoi cell. 


Let f denote the volume of the flutes. Finally, let v denote the volume of the truncated 
Voronoi cell. 


Lemma 16.11. /f we hold the solid angle fixed, the volume of a truncated squashed 
Voronoi cell depends only on q, the volume of the quoins. 


Proof. By the original decomposition, 


v=c+r+uw. 
By our construction, 


vectu'4+q-f. 


Fig. 16.9. Wedges extended to include the Rogers simplices. 
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Fig. 16.10. Decomposition with flutes. 


Fig. 16.11. Planar representation with flutes. 


Fig. 16.12. Detail of Rogers simplex with quoin. 
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Recall that the solid angle s of the quad cluster is the sum of the dihedral angles minus 
2x. The dihedral angles to which we refer are those associated with the edges between 
each corner of the quad cluster and the origin. 

Our perturbation will hold the solid angle s of the quad cluster fixed. Therefore, the 
sum of the dihedral angles must also be fixed. This fixes w’. 

Take the cone over each extended wedge and intersect it with a ball of radius /2 
centered at the origin. Let ¢ denote the sum of these volumes. Since the sum of the 
dihedral angles is fixed, ¢ is also fixed. 

Further, note that 


2/2 
s=c+tr—f. 
3 
This relation implies that c — f is fixed. Combining this with the previous relations, 
we find that if we hold the solid angle fixed, the volume of the truncated Voronoi cell 
depends only on q, the volume of the quoins. oO 


16.9.5. Variation of the Volume ofa Quoin. Consider a face (2, 2, t) of a truncated quad 
cluster. Two Rogers simplices are associated with this face, as suggested in Fig. 16.6. 
Observe that the volume of the quoin associated with one of these Rogers simplices is 
increasing inr = V(16 — 2t?)/(16 — t?). Next, observe that r is in turn decreasing in ft. 
Therefore increasing t decreases the volume of the squashed quad cluster, if we hold the 
solid angle fixed (by varying the length of another edge of the squashed quad cluster). 
Each half of a squashed quad cluster has two variable edge lengths (not counting the 
shared diagonal). We label the variable edge lengths of one-half of the squashed quad 
cluster y; and y2. We label the length of the diagonal d. Holding the solid angle fixed, we 
may perturb one half by shrinking the larger and increasing the shorter length. We wish 
to establish the following lemma, that increasing the short length reduces the volume of 
the truncated Voronoi cell more than decreasing the longer length increases the volume. 


Lemma 16.12. Holding the solid angle fixed for one-half of a squashed quad clus- 
ter, shrinking the longer upper edge (while increasing the shorter edge appropriately) 
reduces the volume of the squashed quad cluster (increasing the score). 


To prove Lemma 16.12, we establish a variational formula for the volume of a quoin. 

We then verify that the volume of the quoin associated with the shorter edge is 
decreasing faster under this perturbation than the volume of the quoin associated with 
the longer edge is increasing. 

In other words, we wish to show that yj; < y2 implies that V(y,) + V(y2()1)) is 
decreasing in y;, or equivalently, 


d 
Vii) + Yoru <0, 
Dal 


where V(t) is the volume of the quoin, V,(t) is the derivative of the volume, and y, is 
an implicit function of y,. 

We construct the volume of a quoin by integrating the area of a slice. We place the 
quoin in a convenient coordinate system. See Figs. 16.13 and 16.14. The truncating 
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Fig. 16.13. Top view of quoin. 


sphere has equation x” + y* + z? = 2. At the base of the quoin, z = 1, sox = /1 — y? 
gives the location of the right-boundary of the quoin. The plane forming the left face of 
the quoin is given by the equation x = sz, so the ridge of the quoin is given by the curve 
(su, y, u), where u = ,/(2 — y2)/(1 +s”). 

Hence the area of a slice parallel to the x-z plane is given by the formula 


A(t, 9) = youu + f Ga- Po = Nae 


SU 


The volume of a quoin is therefore given by the formula 


Vit) = [ A(t, y) dy. 


(su, y, u) 


Fig. 16.14. Side view of quoin. 
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We actually only need to compute V;(t), which is fortunate, since the explicit formula 
for V(t) is somewhat complicated. We have 


Vi) = [ A:(t, y)dy + AC, rrr, 
but A(t, 7) = 0, so 
van = [Ad yay. 
So in addition, we only need A;(t, y), 


S45 1—y? 
A(t, y) = (5@ i= r=¥+ [ 2= 3 =¥9 de), 
‘ 2 t/4/2—y2 


SO 


ss 1 
A(t. y) = (5+) 2 2-y)— yoy2= 9%, 


which simplifies to 


hie 8 2-y? 
mee 6 — PPP We 
Hence 
8r r r? 
V(t) = 


+ J 
(16 — 17) 4/16 — 22 6/16 — 2? 
which simplifies to 
—2,./2(8 — t7)3/2 
3(16— 17)? * 


We are now prepared to prove Lemma 16.12. 


Vi) = 


Proof. Holding the solid angle fixed, y is an implicit function of y,. We wish to prove 
that y; < y2 implies 


dy 
Vi(n1) + Vin) — <0. (16.9) 
dy 
We derive a formula for dy2/dy,, using the solid angle constraint 
sol(2, 2,2, 1, y2,d) =c, (16.10) 


where c is a constant. Using formulas from [Ha6], (16.10) becomes 


2 arctan | —— ] =c. 
2a 
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Let x; = v7; x2 = Yes and b = d2. Then 


A= 4h? = ACY = 25)" + br (8 = 25) + 8x0) 


and 
a= 32—d—x,— Xp. 
So 
—4b? — A(x, — x2)? + b(x1 (8 — x2) + 8x2) 
G2—d—x,—m) et 
Therefore 
dx, (16 — x2)(%2 + b — x1) 
dx,  (16—x,)(4, +b—x)) 
and 
dy, _ y dx2 
dy, yx dx’ 
hence 


dyx yi (16 — x2)(x2 + b — x1) 
dy y2(16 — x1) (x1 + b = x2) 
We substitute the formula for dy2/dy, into (16.9). Letting x; = y?, and noting that 


all the denominators are positive, we obtain on clearing denominators that the desired 
relation (16.9) is equivalent to 


(16.11) 


—(8 = x1)? (16 = x2) 201 +.B = x2) + (8 = 2)°7(16 — xy) 1 (Qa +b — x1) <0, 
or 

(16 — x1)7x1 (8 — x2)°(b = x1 +2) < (16 — x2)? x2(8 — 11) (b + x1 — 3)”. 
If we define 


@(x1, x2) = (16 — x1)?x1(8 — x2)°(b — x1 +42)’, 


then the desired inequality is equivalent to g(x;, x2) < g(%2, x;) for x; < x2. There are 
several ways to prove this monotonicity relation. One is to prove that the polynomial 


g(x, X2) ~ g(x, x1) 
8(x] — x2) 


is positive for all allowable values for x;, x2, and b. Unfortunately, the resulting polyno- 
mial has degree 6, so the verification is somewhat unwieldy, although easy enough using 
interval methods. 

A simpler method involves a factorization of g into g; and gy. We show that g, and 
82 each satisfy the monotonicity relation, and the relation then follows for g. 

Define 


81(%1, X2) = (16 — x1)x1 (8 — x2)(b — x1 + x2) 
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and 


82(H1, X2) = (16 — 21)(8 = 22)°(b — 1 + 2). 
Clearly g = g)g2. We then construct the polynomials 


_ 8141, X2) — 812, x1) 
X1 — X2 
and 
_ 82(%1, X2) — 82(%2, x1) 
: x1 — X2 , 


Simplifying p; and p>, we find that 


p, = 128b — 128x, — 8bx, + 8x? — 128%) 


2 2 2 
+ 32x1x2 + bx1 x2 — Xj xX2 + BxI — XL XZ 


and 


p2 = —2048 + 192b + 320x; — 16bx; — 16x? + 320x2 
—16bx2 — 32x, x2 + bx\x2 + Poeo _ 16x5 + x 1x5. 
These polynomials are quadratic in x; and x, and linear in b. The coefficient of b in p; 
is 
128 — 8x, — 8x2 + x1 xX. 


The coefficient of b in pp is 
192 — 16x, — 16x. + xyx. 


Both coefficients are positive for x, and x2 in [16/2.5 1°, 2.517]. Therefore, the minimum 
values of p; and p2 occur when b is at a minimum, b = 8. 

The minimum value of each polynomial for values of x; and x2 in the range 
[16/2.51°, 2.51] is now easily computed. Making the appropriate computations, we 
find that each polynomial is indeed positive. Hence the desired relation follows. 


16.9.6. Final Simplification. 
Lemma 16.13. Obtuse quad clusters satisfy the bound of Proposition 15.5. 


Proof. We begin with a squashed quad cluster with consecutive upper edge lengths 
(V1, Y2, y3, y4) and diagonal d adjacent to the first two upper edges. 

Recall that we chose the diagonal of the quad cluster to be the shorter of the two 
possible diagonals. We refer to the other possible diagonal as the cross-diagonal. Recall 
that the reduction fixes the length of the diagonal. 

If the length of the cross-diagonal does not drop to 2\/2 under the perturbation of 
Lemma 16.12, we arrive at the configuration with edge lengths (yj, y;, ¥5, 5) with 
diagonal d. 
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If the length of the cross-diagonal does drop to 2,\/2, then stop the perturbation. This 
gives a quadrilateral (y{, y5, v3, ¥,) with diagonal 2/2. Applying the perturbation to 
each half independently, we find that the score of each half is maximized by the config- 
uration (yj’, yi, ¥5, y3) with diagonal 2,/2. We verify the relation for this arrangement 
in Calculation 17.4.4.5. 

If the length of the cross-diagonal did not drop to 2\/2, switch to the cross-diagonal 
and repeat the process. If the (new) cross-diagonal does not drop to 2/2, we have arrived 
at the configuration (y, y, y, y) with diagonal d’. Choose a new diagonal d” to be the 
shorter of the two possible diagonals. We verify the desired relation for this arrangement 
in Calculation 17.4.4.4. 

Finally, we make a few comments about extra constraints in the verifications. 

Since the score of a quad cluster is nonpositive, and m(2 sol(7o)) — b < 0 where 
sol(7o) = sol(2, 2, 2, 2, 2, 2/2), we need only consider quad clusters for which the 
solid angle exceeds 2 sol(7p). 

The maximum length of the diagonal is 2.51./2, since otherwise the triangles in the 
quadrilateral would be obtuse, forcing the cross-diagonal to be shorter than the diagonal. 
This would contradict our original choice of the shortest diagonal. 

In Calculation 17.4.4.4 we assume that d is the shortest diagonal. Adding this con- 
straint directly is tedious, since the formula for the cross-diagonal of the quad cluster is 
somewhat complicated. We apply a simpler but weaker constraint, that the diagonal d of 
a planar quadrilateral with edge lengths (y, y, y, y) is shorter than d’, the other planar 
diagonal. The constraint d < d' gives the constraint d* < 2y. Since the cross-diagonal 
of the quad cluster is shorter than the cross-diagonal of the planar quadrilateral, this 
constraint is weaker. O 


Lemmas 16.9 and 16.13 prove Proposition 15.5 for pure Voronoi quad clusters. 


17. Calculations 


The verifications of the relations required in this paper appear intractable using traditional 
methods. Therefore, we use a relatively new proof technique, interval arithmetic via 
floating-point computer calculations. 


17.1. Interval Arithmetic 


We review the basic notions of interval arithmetic. 

Suppose that the value of a function f(x) lies in the interval [a, b]. Further, suppose 
that g(x) lies in the interval [c, d]. Then f(x) + g(x) must lie in [a +c, b+ d]. While it 
may be the case that we could produce better bounds than this for the function f+ g, these 
interval bounds give crude control over the behavior of the function. Interval arithmetic 
provides a mechanism for formalizing arithmetic on these bounds. 

We represent an interval ¢ as [t ; i | Then for intervals a and b, 


at+b=[a+b,a+5)]. 
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Likewise, 
a—b=[a—b,a—5]. 
Multiplication is somewhat more complicated. Define 
C = {ab, ab, ab, ab}. 
Then 
a * b = [min(C), max(C)]. 


Division is similar, as long as the dividing interval does not contain zero. 
Similarly, we can define the operation of a monotonic function on an interval. For 
example, 


arctan(a) = [arctan(a), arctan(a@)]. 


Using interval arithmetic, we can produce rigorous bounds for polynomials evaluated 
on intervals. Likewise, we can produce rigorous bounds for rational functions evaluated 
on intervals. Finally, we add the composition of monotonic functions. This allows us 
to produce interval bounds for functions such as sol(-) and vor(-) over quasi-regular 
tetrahedrons, quarters, or quad clusters. 


17.2. The Method of Subdivision 


The relations on tetrahedra and quad clusters required for our approach typically have 
the form g(y) < 0 for y € I, where J is a product of closed intervals. As g is usually 
continuous, the existence of a maximum is trivial. However, bounds on the behavior of 
g over all of J computed directly via interval arithmetic are generally poor. 

We define a cell to be a product of closed intervals. By subdividing / into sufficiently 
small cells, the quality of the computed bounds on each cell usually improves enough to 
prove the relation for each cell, and hence for the original domain J. 

If in fact g(y) < c < 0, this approach works very well. However, if the bound is tight 
at a point yo, 1.e., g(yo) = 0, then pure subdivision will usually fail, since the computed 
upper bound on g over any cell containing yo will typically be positive. 

If yo is not an interior maximum, we turn to the partial derivatives of g. If we can 
show that the partials of g on a small cell containing yo have fixed sign (bounded away 
from zero), then the maximum value of g on that cell is easily computed. It is typically 
the case that a cell must be very small before we can determine the sign of the partials 
via interval-arithmetic bounds. 


17.3. Numerical Considerations 


Most real numbers are not representable in computer floating-point format. However, 
floating-point intervals may be found which contain any real number. Although the 
magnitude of real numbers representable in fixed-length floating-point format is finite, 
the format also provides for +-oo, which allows for interval containment of all reals. 
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These intervals may be added, multiplied, etc., and the resulting intervals will contain 
the result of the operation applied to the real numbers which they represent. 

Since floating-point arithmetic is not exact, interval arithmetic conducted using 
floating-point arithmetic is not optimal, in the sense that the interval resulting from 
an operation will usually be larger than the true resultant interval, due to roundoff. How- 
ever, barring hardware or software errors (implementation errors, not roundoff errors), 
floating-point interval arithmetic, unlike floating-point arithmetic, is correct, in the sense 
that it provides correct interval bounds on the value of a computation, while floating-point 
arithmetic alone only provides an approximation to the correct value of a computation. 
We may therefore use interval arithmetic to prove mathematical results. Floating-point 
arithmetic alone, in the absence of rigorous error analysis, cannot constitute a proof. 

We implement floating-point interval arithmetic routines via the IEEE 754 Standard 
for floating-point arithmetic [IEEE]. 

Implementation of interval arithmetic is straightforward using directed rounding. 
In addition to arithmetic functions, we require interval implementations of the square 
root and arctangent functions. Fortunately, the IEEE standard provides the square root 
function. However, the arctangent function is somewhat problematic, since the standard 
mathematical libraries do not provide explicit error bounds for their implementations of 
the arctangent function. In theory, they should provide an accuracy for the arctangent 
routine of 0.7 ulps, meaning that the error is less than one unit in the last place. I add 
interval padding of the form [v — ¢, v+ ¢], where v is the computed value, and ¢ = 2-*”. 
This should be sufficient to guarantee proper interval containment, assuming that the 
library routines are correctly implemented. 

Armed with standard interval arithmetic and interval arithmetic implementations of 
sqrt and arctan, we can implement interval arithmetic versions of all the special functions 
required for proving the sphere packing relations. 

Evaluating these functions on cells, we get bounds. Unfortunately, these bounds are 
not very good. The bounds which we get from interval versions of the partial derivative 
functions are even worse. This means that cells have to be very small before we can draw 
conclusions about the signs of the partials. These bad bounds are due to the inherent 
nature of interval arithmetic—it produces worst-case results by design. 

These bad bounds increase the complexity of the verifications tremendously. Some 
verifications, using these bounds, require the consideration of billions or trillions of cells, 
or worse. Therefore, we needed a method for producing better bounds than those which 
direct interval methods could provide. 

The method which we eventually discovered is to use Taylor series. We compute 
explicit second (mixed) partial bounds for the major special functions, and use these 
bounds to produce very good interval bounds. These bounds are computed in Calculations 
17.4.6.1-17.4.6.8. Essentially, the Taylor method postpones the error bound until the 
end of the computation, eliminating the error bound explosion which occurs with a 
straightforward interval method implementation. 


17.4. Calculations 


The following inequalities have been proved by computer using interval methods. Let 
S = S(y) = SQ1,.--, ¥6) denote a tetrahedron parametrized by the edge lengths 
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(y1,---, 96). In addition, we often parametrize by the squares of the edge lengths 
(X1,.-., %6). 

Recall from Section 15.1 that m = 0.3621, b = 0.49246, a = 0.0739626, and 
be = 0.253095. 


Recall that for our purposes, the scoring function o (-) is given by one of four functions: 
gma(-), vor(-), octavor(-), or truncated Voronoi. See Remark 7.23 for a simplified version 
of the scoring function. 

The scoring rules depend on 7(-), the circumradius of a face, introduced in Defini- 
tion 4.20. 


17.4.1. Quasi-Regular Tetrahedra. Define C = [2, 2.51]°, and recall 


a, = 0.3860658808 124052, dz = 0.4198577862, dy = 1.4674. 


Calculation 17.4.1.1. Either 
gma(S) < a; dih(S) — ag 
or 
gma(S) < —0.52 pt 


for y € C, using dimension-reduction. 


Calculation 17.4.1.2. Either 
gma(S) — a; dih(S) < 3.48 pt — 27a, + 4a 
or 
dih(S'}) < do 
or 
gma(S) < —0.52 pt 
for y € C, using dimension-reduction. 
Calculation 17.4.1.3. Either 
gma(S) +m sol(S) + a(dih(S) — 27/5) — b. < 0 
or 
dih(S) > do 
or 
gma(S) < —0.52 pt 


for y € C, using dimension-reduction. 
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17.4.2. Flat Quad Clusters. Define I = [2, 2.51) [2.51, 2/2], and define the corner 
cell 


C=, 24 0.51/16P2V2 — 2/2 — 2.51)/16, 2/2]. 
Calculation 17.4.2.1. Either 


gma(S)+msol(S) < b/2 


or 

n(Vis Yas Yo)” > 2 
or 

n(y4, Ys, Yo)” > 2 
or 


gma(S) < —1.04 pt 
for y € I, using dimension reduction. 
Calculation 17.4.2.2. Either 


gma(S) +m sol(S) < b/2 


or 

ni, y2,¥6) =2 with (ya, ys, Yo)” <2 
or 

nv, Ys, ¥e)? =2 with (1, 2, 6)” <2 
or 


gma(S) < —1.04 pt 


for y € I, not using dimension-reduction. 
Calculation 17.4.2.3. dvor(S)/dy; < Ofori =1,2,3 andy €C. 


Calculation 17.4.2.4. This computation is somewhat tricky, since the scoring con- 
straint depends on both faces. The partial derivative information gives y3 = 2. The rest 
of the analysis depends on which face is assumed to be large. 

If the (y1, y2, yo) face is large, the partial derivative information implies that the face 
constraint is tight, so n(y1, y2, ye)? = 2. Therefore solve for y, in terms of yz and Yo. 
Apply partial derivative information for y4 and ys. In this case, 


vor(S) +m sol(S) < b/2 


for y3=2,y EC. 
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If the (y4, Ys, yo) face is large, assume that yy = y2 = 2. Then either 
vor(S) + msol($) < b/2 
or 
n(y4, Ys, Yo)” <2 
for yi = y2 = y3 =2,y EC, 
Calculation 17.4.2.5. Either 
vor(S) + msol($) < b/2 
or 


n(y1, Y2, 6)? <2 and nya, ys, Yo)” <2 


or 
vor(S) < —1.04 pt 


for y € I, y € C, using dimension-reduction and partial derivative information. 


Calculation 17.4.2.6. Either 
vor(S) + msol(S) < b/2, 
with 


n(y1, Ya, We)? =2 or nya, ys, Ye)? =2, 


or 
vor(S) < —1.04 pt 


for y € I, y ¢ C, not using dimension-reduction. 
17.4.3. Octahedra 


Calculation 17.4.3.1.  o(S) < —0.52 pt, for each (appropriately scored) upright quar- 
ter with edge lengths in the cell [2.51, 2/2][2.2, 2.51][2, 2.51]. 
Calculation 17.4.3.2. Recall c = 0.1533667634670977 and d = 0.2265. Either 
gma(S) +c dih(S) <d 
or 
gma(S) < —1.04 pt 


for y € (2.51, 2.716][2, 2.2), using dimension-reduction. Note that for both faces ad- 
jacent to the diagonal, 


max 1? = n(2.2, 2.2, 2.716)? < 2, 


so all quarters in this cell are compression-scored. 
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Calculation 17.4.3.3. Either 
IU 


b 
gma(S) + msol(S) + a@ dih(S) < +a 


or 
gma(S) < —1.04pt 
for all compression-scored quarters S(y), where a = 0.14, 
y € [2.716, 2V2][2, 2.2}°[2, 2.51] [2, 2.2], 


using dimension-reduction. 


Calculation 17.4.3.4. Either 


b 
octavor(S) + m sol(S) + @ dih(S) < ri + as 
or 
octavor(S) < —1.04 pt 


for all vor analytic-scored quarters S(y), where a = 0.14, 


y € (2:716, 2.8112, 227 (2, 2.51112, 2.27. 
Calculation 17.4.3.5. Either 


b 
gma(S) +m sol(S) +a dih(S) + 6x1 < 7+ as +88 
or 
gma(S) < —1.04pt 


for all compression-scored quarters S(y), where a = 0.054, B = 0.00455, x; = a 
and 


ye (2.81,97212. 2°72, 25112227. 


using some dimension-reduction. 
Calculation 17.4.3.6. Either 


b 
octavor(S) + msol(S) + a dih(S) + Bx, < i + as + 86 
or 
octavor(S) < —1.04 pt 


for all vor analytic-scored quarters S(y), where a = 0.054, B = —0.00455, x; = re 
and 


ye 28 Oy 212, 2212, 2311227. 
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17.4.4. Pure Voronoi Quad Clusters. Recall sol(7Tp) denotes the solid angle of the 
tetrahedron (2, 2, 2, 2, 2, 2/2). 

Define the corner cell C = [2,2+0.51/8) (2/2, 2.84]. We denote truncated Voronoi 
scoring by o. The constraint that the dividing face be acute translates into x; +x,—x6 > 0. 
In each computation we apply dimension-reduction. 

We begin with the acute case. 


Calculation 17.4.4.1. Either 


o(S) + msol(S) — b/2 < 0 


or 
sol(S) < sol(7p) 
or 
Xj +x2— xX <0 
or 


a(S) < —1.04 pt 
for y € [2, 2.51}[2.84, 4]. 


Calculation 17.4.4.2. Either 


o(S) + msol(S) — b/2 < 0 


or 
sol($) < sol(Tp) 
or 
Xj +x. — xX <0 
or 


a(S) < —1.04 pt 
for y € [2, 2.51}[2V/2, 2.84] with y ¢ C. 


Calculation 17.4.4.3. Either 


o(S)+msol($) — b/2 < 0 


or 
sol($) < sol(7o) 
or 
X1 +X2—X6 < 0, 
yec. 
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Finally, we consider the obtuse case. 


Calculation 17.4.4.4. Either 
o(S)+msol(S) — b/2 < 0 

or 

sol($) < sol(7p) 
or 

o(S) < —0.52 pt 
or 

ay <a’ 
for a symmetric pure Voronoi quad cluster composed of two copies of S, where 
S= (2,2,2,y, y,d), 


y € [4/2.51, 2.51] and d € [2,/2, 2.51,/2]. 


Calculation 17.4.4.5. Either 
o (S|) + a(S) + m(sol(S;) + sol(Sz)) —b < 0 
or 
o(S1) + o(S2) < —1.04 pt 
or 
sol($,) + sol($2) < 2 sol(To) 


for a pure Voronoi quad cluster composed of two tetrahedrons S, and S), where 
Si = (2,2, 2, vis i, 22), 

y; € [4/2.51, 2.51]. 

17.4.5. Dimension Reduction 


Calculation 17.4.5.1. | The polynomial derived for the dimension-reduction argument 
is positive for x € [A, 2.517]° and x € [4, 2.517] [4, 8]. 
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17.4.6. Second Partial Bounds. We compute all second partials d? /dx;dx; in terms 
of x;, the squares of the edge lengths. We do each computation twice, once for quasi- 
regular tetrahedrons and once for quarters. We compute the second partials of dih(-), 
sol(-), compression volume, and Voronoi volume (the vor analytic volume). Since the 
scoring functions are linear combinations of sol(-) and the volume terms, we may derive 
second partial bounds for gma(-) and vor(-) from these. 

With the application of additional computer power, these bounds could be improved. 
These bounds were computed using sixteen subdivisions. While using thirty-two sub- 
divisions would improve the bounds by a factor of 2, perhaps, the time required for the 
computations increases by a factor of 64. 


Calculation 17.4.6.1. | For quasi-regular tetrahedrons T , the second partials of dih(T) 
lie in 


[—0.0926959464, 0.0730008897]. 


Calculation 17.4.6.2._ For quarters Q, the second partials of dih(Q) lie in 
[—0.2384125007, 0.169150875]. 


Calculation 17.4.6.3. For quasi-regular tetrahedrons T , the second partials of sol(T) 
lie in 


[—0.0729140255, 0.088401996]. 


Calculation 17.4.6.4. | For quarters Q, the second partials of so\(Q) lie in 
[—0. 1040074557, 0.1384785805]. 


Calculation 17.4.6.5. For quasi-regular tetrahedrons T , the second partials of gma(T ) 
volume lie in 


[—0.0968945273, 0.0512553817]. 


Calculation 17.4.6.6. For quarters Q, the second partials of gma(Q) volume lie in 


[—0.1362100221, 0.1016538923]. 


Calculation 17.4.6.7._ For quasi-regular tetrahedrons T , the second partials of vor(T) 
volume lie in 


[—0.1856683356, 0.1350478467]. 


Calculation 17.4.6.8. For quarters Q, the second partials of vor(Q) volume lie in 


[—0.2373892383, 0.1994181009]. 
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The computed gma(-) second partials then lie in 
[—0.2119591984, 0.2828323141], 


for quasi-regular tetrahedrons and quarters. 
Likewise, the computed vor(-) second partials then lie in 


[—0.7137209962, 0.8691765157], 


for quasi-regular tetrahedrons and quarters. 
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Abstract. This paper is the sixth and final part in a series of papers devoted to the proof of 
the Kepler conjecture, which asserts that no packing of congruent balls in three dimensions 
has density greater than the face-centered cubic packing. In a previous paper in this series, 
a continuous function f on a compact space is defined, certain points in the domain are 
conjectured to give the global maxima, and the relation between this conjecture and the 
Kepler conjecture is established. In this paper we consider the set of all points in the domain 
for which the value of f is at least the conjectured maximum. To each such point, we attach 
a planar graph. It is proved that each such graph must be isomorphic to a tame graph, of 
which there are only finitely many up to isomorphism. Linear programming methods are 
then used to eliminate all possibilities, except for three special cases treated in earlier papers: 
pentahedral prisms, the face-centered cubic packing, and the hexagonal-close packing. The 
results of this paper rely on long computer calculations. 


Introduction 


This paper is the last in the series of paper devoted to the proof of the Kepler conjecture. 
The first several sections prove a result that asserts that “all contravening graphs are 
tame.” A contravening graph is one that is attached to a potential counterexample to the 
Kepler conjecture. Contravening graphs by nature are elusive and are studied by indirect 
methods. In contrast, the defining properties of tame graphs lend themselves to direct 
examination. (By definition, tame graphs are planar graphs such that the degree of every 
vertex is at least two and at most six, the length of every face is at least 3 and at most 8, 
and such that other similar explicit properties hold true.) 

It is no coincidence that contravening graphs all turn out to be tame. The definition of 
a tame graph has been tailored to suit the situation at hand. We set out to prove explicit 
properties of contravening graphs, and when we are satisfied with what we have proved, 
we brand a graph with these properties a tame graph. 
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The first section of this paper gives the definition of a tame graph. The second section 
gives the classification of all tame graphs. There are several thousand such graphs. The 
classification was carried out by computer. This classification is one of the main uses of 
a computer in the proof of the Kepler conjecture. A detailed description of the algorithm 
that is used to find all tame graphs is presented in this section. 

The third section of this paper gives a review of results from earlier parts of the paper 
that are relevant to the study of tame plane graphs. In the abridged version of the proof 
[Ha15], the results cited in this section are treated as axioms. This section thus serves as 
a guide to the results that are proved in this volume, but not in the abridged version of 
the proof. 

This section also contains a careful definition of what it means to be a contravening 
plane graph. The first approximation to the definition is that it is the combinatorial plane 
graph associated with the net of edges on the unit sphere bounding the standard regions 
of a contravening decomposition star. The precise definition is somewhat more subtle 
because we wish ensure that every face of a contravening plane graph is a simple polygon. 
To guarantee that this property holds, we simplify the net of edges on the unit sphere 
whenever necessary. 

The fourth and fifth sections of this paper contain the proof that all contravening plane 
graphs are tame. These sections complete the first half of this paper. 

The second half of this paper is about linear programming. Linear programs are used 
to prove that with the exception of three tame graphs (those attached to the face-centered 
cubic packing, the hexagonal-close packing, and the pentahedral prism), a tame graph 
cannot be a contravening graph. This result reduces the proof of the Kepler conjecture 
to a close examination of three graphs. Pentahedral prism graphs are treated in the 
fifth paper. The face-centered cubic and hexagonal-close packing graphs are treated in 
Section 8 of the third paper. The linear programming results together with these earlier 
results complete the proof of the Kepler conjecture. 

The sixth section of this paper describes how to attach a linear program to a tame 
plane graph. The output from this linear program is an upper bound on the score of 
all decomposition stars associated with the given tame plane graph. The seventh sec- 
tion of this paper shows how to use linear programs to eliminate what are called the 
aggregate tame plane graphs. The aggregates are those cases where the net of edges 
formed by the edges of standard regions was simplified to ensure that every face of 
a contravening plane graph is a polygon. By the end of this section, we have a proof 
that every standard region in a contravening decomposition star is bounded by a simple 
polygon. 

The final section of this paper gives a long list of special strategies that are used when 
the output from the linear program in the sixth section does not give conclusive results. 
The general strategy is to partition the original linear program into a collection of refined 
linear programs with the property that the score is no greater than the maximum of the 
outputs from the linear programs in the collection. These branch and bound strategies 
are described in this final section. Linear programming shows that every decomposition 
star with a tame plane graph (other than the three mentioned above) has a score less than 
that of the decomposition stars attached to the face-centered cubic packing. This and 
earlier results imply the Kepler conjecture. 
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This section defines a class of plane graphs. Graphs in this class are said to be tame. In 
the next section we give a complete classification of all tame graphs. This classification 
of tame graphs was carried out by computer and is a major step of the proof of the Kepler 
conjecture. 


18.1. Basic Definitions 


Definition 18.1. An n-cycle is a finite set C of cardinality n, together with a cyclic 
permutation s of C. We write s in the form v + s(v, C), for v € C. The element s(v, C) 
is called the successor of v (in C). A cycle is an n-cycle for some natural number n. 
By abuse of language, we often identify C with the cycle. The natural number n is the 
length of the cycle. 


Definition 18.2. Let G be a nonempty finite set of cycles (called faces) of length at 
least 3. The elements of faces are called the vertices of G. An unordered pair of vertices 
{uv, w} such that one element is the successor of the other in some face is called an edge. 
The vertices v and w are then said to be adjacent. The set G is a plane graph if four 
conditions hold: 


1. If an element v has successor w in some face F’, then there is a unique face (call 
it s’(F, v)) in G for which v is the successor of w. (Thus, v = s(w, s’(F, v)), and 
each edge occurs twice with opposite orientation.) 

2. For each vertex v, the function F +> s‘(F, v) is a cyclic permutation of the set of 
faces containing v. 

3. Euler’s formula holds relating the number of vertices V, the number of edges E, 
and the number of faces F: 


V-E+F=2. 


4. The set of vertices is connected. That is, the only nonempty set of vertices that is 
closed under v +> s(v, C) for all C is the full set of vertices. 


Remark 18.3. The set of vertices and edges of a plane graph form a planar graph in 
the usual graph-theoretic sense of admitting an embedding into the plane. Every planar 
graph carries an orientation on its faces that is inherited from an orientation of the 
plane. (Use the right-hand rule on the face, to orient it with the given outward normal 
of the oriented plane.) For us, the orientation is built into the definition, so that properly 
speaking, we should call these objects oriented plane graphs. We follow the convention 
of distinguishing between planar graphs (which admit an embedding into the plane) 
and plane graphs (for which a choice of embedding has been made). Our definition is 
more restrictive than the standard definition of plane graph in the literature, because we 
require all faces to be simple polygons with at least three vertices. Thus, a graph with 
a single edge does not comply with our narrow definition of plane graph. Other graphs 
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Pee 


Fig. 18.1. Some examples of graphs that are excluded from the narrow definition of plane graph, as defined 
in this section. 


that are excluded by this definition are shown in Fig. 18.1. Standard results about plane 
graphs can be found in any of a number of graph theory textbooks. However, this paper 
is written in such a way that it should not be necessary to consult outside graph theory 
references. 


Definition 18.4. Let /en be the length function on faces. Faces of length 3 are called 
triangles, those of length 4 are called quadrilaterals, and so forth. Let tri(v) be the number 
of triangles containing a vertex v. A face of length at least 5 is called an exceptional 
face. 


Two plane graphs are properly isomorphic if there is a bijection of vertices inducing 
a bijection of faces. For each plane graph, there is an opposite plane graph G°? obtained 
by reversing the cyclic order of vertices in each face. A plane graph G is isomorphic to 
another if G or G°? is properly isomorphic to the other. 


Definition 18.5. The degree of a vertex is the number of faces it belongs to. An n- 
circuit in G is a cycle C in the vertex-set of G, such that for every v € C, it forms an 
edge in G with its successor: that is, (v, s(v, C)) is an edge of G. 


In a plane graph G we have a combinatorial form of the Jordan curve theorem: each 
n-circuit determines a partition of G into two sets of faces. 


Definition 18.6. The type of a vertex is defined to be a triple of nonnegative integers 
(p,q,1r), where p is the number of triangles containing the vertex, g is the number of 
quadrilaterals containing it, and r is the number of exceptional faces. When r = 0, we 
abbreviate the type to the ordered pair (p, q). 


18.2. Weight Assignments 
We call the constant tgt = 14.8, which arises repeatedly in this section, the tar- 


get. (This constant arises as an approximation to 47g — 8 © 14.7947, where ¢ = 


1/(2 arctan(/2/5)).) 
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Define a: N > R by 


14.8, n=0, 1,2, 


1.4, n=3, 
COs. n=4, 
0, otherwise. 


Define b: N x N > R by b(p, g) = 14.8, except for the values in the following table 
(with tgt = 14.8): 


q 
p 0 1 2 3 4 
0 tgt tgt tgt 7.135 10.649 
1 tet tet 695 7.135 tet 
2 tet 8.5 4.756 12.981  tgt 
3 tgt 3.642 8.334 tgt tgt 
4 4.139 3.781 tgt tgt tgt 
5 0.55 11.22 tgt tgt tgt 
6 6.339 tgt tgt tgt tgt 
Define c: N > R by 
1, n=3, 
0, n=4, 
c(n) = 4 —1.03, n=5, 
—2.06, n= 6, 
—3.03, otherwise. 
Define d: N > R by 
0, n= 3, 
2.378, n=4, 
4.896, n=5, 
d(n) = +7414, n= 6, 
9.932, n=7, 
10.916, n= 8, 
tgt = 14.8, otherwise. 


A set V of vertices is called a separated set of vertices if the following four conditions 
hold: 


1. For every vertex in V there is an exceptional face containing it. 
2. No two vertices in V are adjacent. 

3. No two vertices in V lie on a common quadrilateral. 

4. Each vertex in V has degree 5. 


A weight assignment of a plane graph G is a function w: G — R taking values in 
the set of nonnegative real numbers. A weight assignment is admissible if the following 
properties hold: 


1. If the face F has length n, then w(F) > d(n). 
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2. If v has type (p, q), then 
> w(F) = b(p,q). 


F: veF 


3. Let V be any set of vertices of type (5, 0). If the cardinality of V is k < 4, then 
> w(F) > 0.55k. 


F: VAF#O 
4. Let V be any separated set of vertices. Then 


>) WF) — dden(F))) = Y/ altri). 


F: VAF#O veV 


The sum )°,, w(F) is called the total weight of w. 


18.3. Plane Graph Properties 


We say that a plane graph is tame if it satisfies the following conditions: 


. The length of each face is at least 3 and at most 8. 

. Every 3-circuit is a face or the opposite of a face. 

. Every 4-circuit surrounds one of the cases illustrated in Fig. 18.2. 

. The degree of every vertex is at least 2 and at most 6. 

. If a vertex is contained in an exceptional face, then the degree of the vertex is at 
most 5. 


6. 2 c(len(F)) > 8. 


F 

7. There exists an admissible weight assignment of total weight less than the target, 
tgt = 14.8. 

8. There are never two vertices of type (4, 0) that are adjacent to each other. 


nABWN re 


Fig. 18.2. Tame 4-circuits. 
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It follows from the definitions that the abstract vertex-edge graph of G has no loops or 
multiple joins. Also, by construction, every vertex lies in at least two faces. Property 6 
implies that the graph has at least eight triangles. 


Remark 18.7. We pause to review the strategy of the proof of the Kepler conjecture 
as described in Section 3.2. The decomposition stars that violate the main inequality 
o(D) => 8 pt are said to contravene. A plane graph is associated with each contravening 
decomposition star. These are the contravening plane graphs. The main object of this 
paper is to prove that the only two contravening graphs are Gg-c and Ghep, the graphs 
associated with the face-centered cubic and hexagonal-close packings. 

We have defined a set of plane graphs, called tame graphs. The next section will give a 
classification of tame plane graphs. (There are several thousand.) Section 20 gives a proof 
that all contravening plane graphs are tame. By the classification result, this reduces the 
possible contravening graphs to an explicit finite list. Case-by-case linear programming 
arguments will show that none of these tame plane graphs is a contravening graph (except 
Gjcc and Ghep). Having eliminated all possible graphs, we arrive at the resolution of the 
Kepler conjecture. 


19. Classification of Tame Plane Graphs 


19.1. Statement of the Theorem 


A list of several thousand plane graphs appears in [Hal6]. The following theorem is 
listed as one of the central claims in the proof in Section 3.3. 


Theorem 19.1. Every tame plane graph is isomorphic to a plane graph in this list. 


The results of this section are not needed except in the proof of Theorem 19.1. 

Computers are used to generate a list of all tame plane graphs and to check them against 
the archive of tame plane graphs. We describe a finite state machine that produces all 
tame plane graphs. This machine is not particularly efficient, and so we also include a 
description of pruning strategies that prevent a combinatorial explosion of possibilities. 


19.2. Basic Definitions 


In order to describe how all tame plane graphs are generated, we need to introduce partial 
plane graphs that encode an incompletely generated tame graph. A partial plane graph 
is itself a graph, but marked in such a way to indicate that it is in a transitional state that 
will be used to generate further plane graphs. 


Definition 19.2. A partial plane graph is a plane graph with additional data: every 
face is marked as “complete” or “incomplete.” We call a face complete or incomplete 
according to the markings. We require the following condition: 


e No two incomplete faces share an edge. 
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Each unmarked plane graph is identified with the marked plane graph in which every 
face is complete. We represent a partial plane graph graphically by deleting one face (the 
face at infinity) and drawing the others and shading those that are complete. 

A patch is a partial plane graph P with two distinguished faces F, and F2, such that 
the following hold: 


e Every vertex of P lies in F, or Fo. 

e The face Fy) is the only complete face. 

e F;, and F; share an edge. 

e Every vertex of F2 that is not in F; has degree 2. 


F, and Fy» will be referred to as the distinguished incomplete and the distinguished 
complete faces, respectively. 

Patches can be used to modify a partial plane graph as follows. Let F be an incom- 
plete face of length n in a partial plane graph G. Let P be a patch whose incomplete 
distinguished face F, has length n. Replace P with a properly isomorphic patch P’ in 
which the image of F| is equal to F°? and in which no other vertex of P’ is a vertex of 
G. Then 


G ={F'e€GUP: F AF”, F' #F} 


is a partial plane graph. Intuitively, we cut away the faces F and F; from their plane 
graphs, and glue the holes together along the boundary (Fig. 19.1). (It is immediate that 
the condition in the definition of partial plane graphs (Definition 19.2) is maintained 
by this process.) There are n distinct proper ways of identifying F; with F°? in this 
construction, and we let ¢ be this identification. The isomorphism class of G’ is uniquely 
determined by the isomorphism class of G, the isomorphism class of P, and g (ranging 
over proper bijections g: F; t+ F°?). 


19.3. A Finite State Machine 


For a fixed N we define a finite state machine as follows. The states of the finite state 
machine are isomorphism classes of partial plane graphs G with at most N vertices. The 


ay 


Fig. 19.1. Patching a plane graph. 
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transitions from one state G to another are isomorphism classes of pairs (P, g) where P 
is a patch, and g pairs an incomplete face of G with the distinguished incomplete face of 
P. However, we exclude a transition (P, ¢) at a state if the resulting partial plane graphs 
contains more than N vertices. Figure 19.1 shows two states and a transition between 
them. 

The initial states /,, of the finite state machine are defined to be the isomorphism 
classes of partial plane graphs with two faces: 


{(, 2,...,2),(m,n—1,..., D}, 


where n < N, one face is complete, and the other is incomplete. In other words, they 
are patches with exactly two faces. 

A terminal state of this finite state machine is one in which every face is com- 
plete. By construction, these are (isomorphism classes of) plane graphs with at most N 
vertices. 


Lemma 19.3. Let G be a plane graph with at most N vertices. Then its state in the 
machine is reachable from an initial state through a series of transitions. 


Proof. Pick a face in G of length n and identify it with the complete face in the initial 
state J,,. At any stage at state G’, we have an identification of all of the vertices of the plane 
graph G’ with some of the vertices of G, and an identification of all of the complete faces 
of G' with some of the faces of G (all faces of G are complete). Pick an incomplete face 
F of G’ and an oriented edge along that face. We let F’ be the complete face of G with 
that edge, with the same orientation on that edge as F’. Create a patch with distinguished 
faces F, = F°? and Fy, = F’. (F, and F) determine the patch up to isomorphism.) It is 
immediate that the conditions defining a patch are fulfilled. Continue in this way until a 
graph isomorphic to G is reached. 


Remark 19.4. It is an elementary matter to generate all patches P such that the dis- 
tinguished faces have given lengths n and m. Patching is also entirely algorithmic, and 
thus by following all paths through the finite state machine, we obtain all plane graphs 
with at most N vertices. 


19.4. Pruning Strategies 


Although we reach all graphs in this manner, it is not computationally efficient. We 
introduce pruning strategies to increase the efficiency of the search. We can terminate 
our search along a path through the finite state machine, if we can determine one of the 
following: 


1. Every terminal graph along that path violates one of the defining properties of 
tameness. 

2. Anisomorphic terminal graph will be reached by some other path that will not be 
terminated early. 
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Here are some pruning strategies of type 1 above. They are immediate consequences 
of the conditions of the defining properties of tameness. 


e If the current state contains an incomplete face of length 3, then eliminate all 
transitions, except for the transition that carries the partial plane graph to a par- 
tial plane graph that is the same in all respects, except that the face has become 
complete. 

e If the current state contains an incomplete face of length 4, then eliminate all 
transitions except those that lead to the possibilities of Section 18.3, Property 3, 
where in Property 3 each depicted face is interpreted as being complete. 

e Remove all transitions with patches whose complete face has length greater than 8. 

e Itis frequently possible to conclude from the examination of a partial plane graph 
that no matter what the terminal position, any admissible weight assignment will 
give total weight greater than the target (tgt = 14.8). In such cases, all transitions 
out of the partial plane graph can be pruned. 


To take a simple example of the last item, we observe that weights are always non- 
negative, and that the weight of a complete face of length n is at least d(n). Thus, if 
there are complete faces F;,..., Fy of lengths m;,...,,, then any admissible weight 
assignment has total weight at least ye d(n;). If this number is at least the target, then 
no transitions out of that state need be considered. 

More generally, we can apply all of the inequalities in the definition of admissible 
weight assignment to the complete portion of the partial plane graph to obtain lower 
bounds. However, we must be careful, in applying Property 4 of admissible weight 
assignments, because vertices that are not adjacent at an intermediate state may become 
adjacent in the complete graph. Also, vertices that do not lie together in a quadrilateral 
at an intermediate state may do so in the complete graph. 

Here are some pruning strategies of type 2: 


e Ata given state it is enough to fix one incomplete face and one edge of that face 
and then to follow only the transitions that patch along that face and add a complete 
face along that edge. (This is seen from the proof of Lemma 19.3.) 

e In leading out from the initial state /,,, it is enough to follow paths in which every 
added complete face has length at most n. (A graph with a face of length m, for 
m > n, will be also be found downstream from /,,.) 

e Make a list of all type (p,q) with b(p,q) < tgt = 14.8. Remove the initial 
states /; and J4, and create new initial states J, , (J is gad Bs q; etc.) in the finite state 
machine. Define the state J,,, to be one consisting of p + g + 1 faces, with p 
complete triangles and q complete quadrilaterals all meeting at a vertex (and one 
other incomplete face away from v). (If there is more than one way to arrange p 
triangles and q quadrilaterals, create states I, I, ,, 1, ,, for each possibility. See 

Fig. 19.2.) Put a linear order on states /,,,. In state transitions downstream from 

Iy,q disallow any transition that creates a vertex of type (p’, g'), for any (p’, q') 


preceding (p, qg) in the imposed linear order. 


This last pruning strategy is justified by the following lemma, which classifies vertices 
of type (p,q). 
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\Z 


Fig. 19.2. States /3,2 and J; ,. 


Lemma 19.5. Let A and B be triangular or quadrilateral faces that have at least two 
vertices in common ina tame graph. Then the faces have exactly two vertices in common, 
and an edge is shared by the two faces. 


Proof. Exercise. Some of the configurations that must be ruled out are shown in 
Fig. 19.3. Some properties that are particularly useful for the exercise are Properties 2 
and 3 of tameness, and Property 2 of admissibility. 


Once a terminal position is reached it is checked to see whether it satisfies all the 
properties of tameness. 

Duplication is removed among isomorphic terminal plane graphs. It is not an en- 
tirely trivial procedure for the computer to determine whether an isomorphism between 
two plane graphs exists. This is accomplished by computing a numerical invariant of 
a vertex that depends only on the local structure of the vertex. If two plane graphs are 
properly isomorphic then the numerical invariant is the same at vertices that correspond 
under the proper isomorphism. If two graphs have the same number of vertices with the 
same numerical invariants, they become candidates for an isomorphism. All possible 
numerical-invariant preserving bijections are attempted until a proper isomorphism is 
found, or until it is found that none exist. If there is no proper isomorphism, the same 
procedure is applied to the opposite plane graph to find any possible orientation-reversing 


isomorphism. 


Fig. 19.3. Some impossibilities. 
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This same isomorphism-producing algorithm is used to match each terminal graph 
with a graph in the archive. It is found that each terminal graph matches with one in 
the archive. (The archive was originally obtained by running the finite state machine 
and making a list of all the terminal states up to isomorphism that satisfy the given 
conditions.) 

In this way Theorem 19.1 is proved. 


20. Contravening Graphs 


We have seen that a system of points and arcs on the unit sphere can be associated with 
a decomposition star D. The points are the radial projections of the vertices of U(D) 
(those at distance at most 2f9 = 2.51 from the origin). The arcs are the radial projections 
of edges between v, w € U(D), where |v — w| < 2%. If we consider this collection of 
arcs combinatorially as a graph, then it is not always true that these arcs form a plane 
graph in the restrictive sense of Section 18. 

The purpose of this section is to show that if the original decomposition star contra- 
venes, then minor modifications can be made to the system of arcs of the graph so that 
the resulting combinatorial graph has the structure of a plane graph in the sense of Sec- 
tion 18. These plane graphs are called contravening plane graphs, or simply contravening 
graphs. 


20.1. A Review of Earlier Results 


Let¢ = 1/(2 arctan(/2/ 5)). Let sol(R) denote the solid angle of a standard region R. 
We write Tr for the following modification of op: 


tr(D) = sol(R)¢ pt — or(D) (20.1) 
and 
t(D) = ee ta(D) = 4¢ pt —o(D). (20.2) 


Since 47r¢ pt is a constant, tT and o contain the same information, but t is often more 
convenient to work with. A contravening decomposition star satisfies 


t(D) < 41¢ pt — 8 pt = (4g — 8) pt. (20.3) 


The constant (47 ¢ — 8) pt (and its upper bound tgt pt where tgt = 14.8) occurs repeatedly 
in the discussion that follows. 

Recall that a standard cluster is a pair (R, D) consisting of a decomposition star D 
and one of its standard regions R. If F is a finite set (or finite union) of standard regions, 
let 


or(D) =) or(D), — tr(D) = 9 tr(D), (20.4) 
R R 


where the sum runs over all the standard regions in F. When the sum runs over all 
standard regions, 


o(D) = > or(D), t(D) = 2D Ttr(D). (20.5) 
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Cae 
ze 


Fig. 20.1. Nonpolygonal standard regions (n(R) = 7, 7, 8, 8, 8). 


A natural number 1(R) is associated with each standard region. If the boundary of 
that region is a simple polygon, then n(R) is the number of sides. If the boundary consists 
of k disjoint simple polygons, with 11, ..., x sides then 


n(R) = ny +--+ +g + 2(k — 1). 


Lemma 20.1. Let R be a standard region in a contravening decomposition star D. The 
boundary of R is a simple polygon with at most eight edges, or one of the configurations 
of Fig. 20.1. 


Proof. This is Theorem 12.1 and Corollary 12.2. 


Lemma 20.2. Let R be a standard region. We have tr(D) > ty, where n = n(R), and 


B= 0, t4 = 0.1317, ts = 0.27113, 
te = 0.41056, t7 = 0.54999, tg = 0.6045. 


Furthermore, or(D) < Sy, for 5 <n < 8, where 
s3 = | pt, s4 = 0, 55 = —0.05704, 
56 = —0.11408, $7 = —0.17112, Sg = —0.22816. 


Proof. This is Theorem 12.1. O 


Lemma 20.3. Let F be a set of standard regions bounded by a simple polygon with at 
most nine edges. Assume that 


of(D) <s9 and tr(D)>= to, 


where so = —0.1972 and to = 0.6978. Then D does not contravene. 


Proof. This is Section 12.2. 
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Lemma 20.4. Let (R, D) be a standard cluster. If R is a triangular region, then 


oR(D) < | pt. 
If R is not a triangular region, then 
or(D) <9. 
Proof. See Lemma 8.10 and Theorem 8.4. O 


Lemma 20.5. tr(D) > 0, for all standard clusters R. 


Proof. This is Lemma 10.1. 


Recall that v has type (p, q) if every standard region with a vertex at v is a triangle 
or a quadrilateral, and if there are exactly p triangular faces and g quadrilateral faces 
that meet at v (see Definition 18.6). We write (py, g,) for the type of v. Define constants 
tip(p, g)/pt by Table 20.1. The entries marked with an asterisk will not be needed. 


Lemma 20.6. Let S,,..., 5, and R,,..., Rg be the tetrahedra and quad clusters 
around a vertex of type (p,q). Consider the constants of Table 20.1. Now, 


P q 
Yi (Si) + >> t(Ri) = t(D, g). 


Proof. This is Lemma 10.5. 


Lemma 20.7. Let vj,..., ug, for some k < 4, be distinct vertices of type (5,0). Let 
S;,..., 8, be quasi-regular tetrahedra around the edges (0, v;), fori < k. Then 


Y° t(S)) > 0.55k prt 


i=1 


Table 20.1. tLp(p, q)/pt. 


q 
Pp 0 1 2 3 4 5 6 
* * 15.18 7.135 10.6497 22.27 

1 * # 6.95 7.135 17.62 32.3 

2 # 8.5 4.756 12.9814 * x 

3 # 3.6426 8.334 —-20.9 * 

4 4.1396 3.7812 16.11 ‘ * * 

5 0.55 11.22 x * * * 

6 6.339 * # * * * 

7 14.76 “ * * * * 
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and 
‘ 


> o(S;) <r pt — 0.48k pt. 


i=l 


Proof. This is Lemma 10.6. 


Lemma 20.8. Let D be a contravening decomposition star. If the type of the vertex is 
(p, 4,1) with r = 0, then (p, q) must be one of the following: 
{(6, 0), (5, 0), (4, 0), G, 1), 4, D, G, D, @, 1), 
(3, 2), (2, 2), (1, 2), (2, 3), (1, 3), (, 3), (0, 4)}. 


Proof. This is Lemma 10.10 and Corollary 12.3. 


Lemma 20.9. A triangular standard region does not contain any enclosed vertices. 


Proof. This fact is proved in Lemma 3.7 of [Ha6]. 


Lemma 20.10. A quadrilateral region does not enclose any vertices of height at most 
2to. 


Proof. This is Lemma 10.13. 


Lemma 20.11. Let F be a union of standard regions. Suppose that the boundary of F 
consists of four edges. Suppose that the area of F is at most 21. Then there is at most 
one enclosed vertex over F. 


Proof. This is Proposition 4.2 of [Ha6]. 


Lemma 20.12. Let F be the union of two standard regions, a triangular region and a 
pentagonal region that meet at a vertex of type (1, 0, 1) as shown in Fig. 20.2. Then 


t(D) > 11.16 pt. 


Proof. This is Lemma 14.4. 


Lemma 20.13. Let R be an exceptional standard region. Suppose that R has r different 
interior angles that are pairwise nonadjacent and such that each is at most 1.32. Then 


tr(D) > t +r(1.47) pt. 


Proof. This is Remark 14.2. 
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Fig. 20.2. A 4-circuit. 


Lemma 20.14. Every interior angle of every standard region is at least 0.8638. Every 
interior angle of every standard region that is not a triangle is at least 1.153. 


Proof. CALC-208809199 and CALC-853728973-1. 


Definition 20.15. The central vertex of a flat quarter is defined to be the one that does 
not lie on the triangle formed by the origin and the diagonal. 


Lemma 20.16.  /f the interior angle at a corner v of a nontriangular standard region 
is at most 1.32, then there is a flat quarter over R whose central vertex is v. 


Proof. This is Lemma 11.30. 


20.2. Contravening Plane Graphs Defined 


A plane graph G is attached to every contravening decomposition star as follows. From 
the decomposition star D, it is possible to determine the coordinates of the set U(D) of 
vertices at distance at most 2%) from the origin. 

If we draw a geodesic arc on the unit sphere at the origin with endpoints at the 
radial projections of v; and v2 for every pair of vertices vj, v2 € U(D) such that 
|vi|, va], [v1 — v2] < 2to, we obtain a plane graph that breaks the unit sphere into 
standard regions. (The arcs do not meet except at endpoints by Lemma 4.19.) 

For a given standard region, we consider the arcs forming its boundary together with 
the arcs that are internal to the standard region. We consider the points on the unit sphere 
formed by the endpoints of the arcs, together with the radial projections to the unit sphere 
of vertices in U whose radial projection lies in the interior of the region. 


Remark 20.17. The system of arcs and vertices associated with a standard region in a 
contravening example must be a polygon, or one of the configurations of Fig. 20.1 (see 
Lemma 20.1). 
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Fig. 20.3. An aggregate forming a pentagon. 


Remark 20.18. Observe that one case of Fig. 20.1 is bounded by a triangle and a 
pentagon, and that the others are bounded by a polygon. Replacing the triangle—-pentagon 
arrangement with the bounding pentagon and replacing the others with the bounding 
polygon, we obtain a partition of the sphere into simple polygons. Each of these polygons 
is a single standard region, except in the triangle—pentagon case (Fig. 20.3), which is a 
union of two standard regions (a triangle and an eight-sided region). 


Remark 20.19. To simplify further, if we have an arrangement of six standard regions 
around a vertex formed from five triangles and one pentagon, we replace it with the 
bounding octagon (or hexagon). See Fig. 20.4. (It will be shown in Lemma 21.11 that 
there is at most one such configuration in the standard decomposition of a contraven- 
ing decomposition star, so we will not worry here about how to treat the case of two 
overlapping configurations of this sort.) 


In summary, we have a plane graph that is approximately that given by the standard 
regions of the decomposition star, but simplified to a bounding polygon when one of 
the configurations of Remarks 20.18 and 20.19 occur. We refer to the combination of 
standard regions into a single face of the graph as aggregation. We call it the plane graph 
G = G(D) attached to a contravening decomposition star D. 

Proposition 21.1 will show the vertex set U is nonempty and that the graph G(D) is 
nonempty. 

When we refer to the plane graph in this manner, we mean the combinatorial plane 
graph as opposed to the embedded metric graph on the unit sphere formed from the 
system of geodesic arcs. Given a vertex v in G(D), there is a uniquely determined vertex 
u(D) of U(D) whose radial projection to the unit sphere determines v. We call v(D) the 
corner in U(D) over v. 

By construction, the plane graphs associated with a decomposition star do not have 
loops or multiple joins. In fact, the edges of G(D) are defined by triangles whose sides 


Fig. 20.4. Degree 6 aggregates. 
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vary between lengths 2 and 2%). The angles of such a triangle are strictly less than z. 
This implies that the edges of the metric graph on the unit sphere always have an arc- 
length strictly less than z. In particular, the endpoints are never antipodal. A loop on the 
combinatorial graph corresponds to an edge on the metric graph that is a closed geodesic. 
A multiple join on the combinatorial graph corresponds on the metric graph to a pair of 
points joined by multiple minimal geodesics, that is, a pair of antipodal points on the 
sphere. By the arc-length constraints on edges in the metric graph, there are no loops or 
multiple joins in the combinatorial graph G(D). 

In Section 18.3 a plane graph satisfying a certain restrictive set of properties is said 
to be tame. If a plane graph G(D) is associated with a contravening decomposition star 
D, we call G(D) a contravening plane graph. 


Theorem 20.20. Let D be a contravening decomposition star. Then its plane graph 
G(D) is tame. 


This theorem is one of the main steps in the proof of the Kepler conjecture. It is 
advanced as one of the central claims in Section 3.3. Its proof occupies Sections 21 and 
22. In Theorem 19.1 the tame graphs are classified up to isomorphism. As a corollary, 
we have an explicit list of graphs that contains all contravening plane graphs. 


21. Contravention is tame 


This section begins the proof of Theorem 20.20 (contravening graphs are tame). To 
prove Theorem 20.20, it is enough to show that each defining property of tameness is 
satisfied for every contravening graph. This is the substance of results in the following 
sections. The proof continues through to the end of Section 22. This section verifies all 
the properties of tameness, except for the last one (weight assignments). 


21.1. First Properties 


This section verifies Properties 1, 2, 4, and 8 of tameness. First, we prove the promised 
nondegeneracy result. 


Proposition 21.1. The construction of Section 20.2 associates a (nonempty) plane 
graph with at least two faces to every decomposition star D with ao (D) > 0. 


Proof. First we show that decomposition stars with o(D) > 0 have nonempty vertex 
sets U. (Recall that U is the set of vertices of distance at most 2to from the center.) The 
vertices of U are used in Sections 4 and 5 to create all of the structural features of the 
decomposition star: quasi-regular tetrahedra, quarters, and so forth. If U is empty, the 
V-cell is a solid containing the ball B(to) of radius to, and o (D) satisfies 


o(D) = vor(D) 
—4bocrvol(VC(D)) + 420 /3 
< —46oVol(B(to)) + 4277/3 < 0. 


By hypothesis, o(D) > 0. So U is not empty. 
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Equation (20.5) shows that the function o can be expressed as a sum of terms Op 
indexed by the standard regions R. It is proved in Theorem 8.4 that or < 0, unless R isa 
triangle. Thus, a decomposition star with positive o(D) must have at least one triangle. 
Its complement contains a second standard region. Even after we form aggregates of 
distinct standard regions to form the simplified plane graph (Remarks 20.18 and 20.19), 
there certainly remain at least two faces. 


Proposition 21.2. The plane graph of a contravening decomposition star satisfies 
Property | of tameness: The length of each face is at least 3 and at most 8. 


Proof. By the construction of the graph, each face has at least three edges. The upper 
bound of eight edges is Lemma 20.1. Note that the aggregates of Remarks 20.19 and 
20.18 have between five and eight edges. 


Proposition 21.3. The plane graph of a contravening decomposition star satisfies 
Property 2 of tameness: Every 3-circuit is a face or the opposite of a face. 


Proof. The simplifications of the plane graph in Remarks 20.18 and 20.19 do not 
produce any new 3-circuits. (See the accompanying figures.) The result is Lemma 20.9. 
O 


Proposition 21.4. Contravening graphs satisfy Property 4 of tameness: The degree of 
every vertex is at least 2 and at most 6. 


Proof. The statement that degrees are at least 2 trivially follows because each vertex 
lies on at least one polygon, with two edges at that vertex. 

If the type is (p, q), then the impossibility of a vertex of degree 7 or more is found 
in Lemma 20.8. If the type is (p,q,r), with r > 1, then Lemma 20.14 shows that the 
interior angles of the standard regions cannot sum to 27: 


6(0.8638) + 1.153 > 27. 


Proposition 21.5. Contravening graphs satisfy Property 8 of tameness: There are never 
two vertices of type (4, 0) that are adjacent to each other. 


Proof. This is proved in Proposition 4.2 of [Ha6]. O 


21.2. Computer Calculations and Their Consequences 


This section continues in the proof that all contravening plane graphs are tame. The next 
few sections verify Properties 6, 5, and then 3 of tameness. 

In this section we rely on some inequalities that are not proved in this paper. Recall 
from Section 8.3 that there is an archive of hundreds of inequalities that have been proved 
by computer. This full archive appears in [Ha16]. The justification of these inequalities 
appears in the same archive. (The proofs of these inequalities were executed by computer.) 
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Each inequality carries a nine digit identifying number. To invoke an inequality, we state 
it precisely, and give its identifying number, e.g., CALC-123456789. 

To use these inequalities systematically, we combine inequalities into linear programs 
and solve the linear programs on computer. At first, our use of linear programs will be 
light, but our reliance will become progressively strong as the argument develops. 

To start out, we make use of several calculations!” that give lower bounds on Tr(D) 
when R is a triangle or a quadrilateral. To obtain lower bounds through linear program- 
ming, we take a linear relaxation. Specifically, we introduce a linear variable for each 
function Tr and a linear variable for each interior angle wr. We substitute these linear 
variables for the nonlinear functions trg(D) and nonlinear interior angle function into 
the given inequalities. Under these substitutions, the inequalities become linear. Given p 
triangles and q quadrilaterals at a vertex, we have the linear program to minimize the sum 
of the (linear variables associated with) tr(D) subject to the constraint that the (linear 
variables associated with the) angles at the vertex sum to at most d. Linear programming 
yields!*? a lower bound t,p(p, g, d) to this minimization problem. This gives a lower 
bound to the corresponding constrained sum of nonlinear functions Tp. 

Similarly, another group of inequalities'** yields upper bounds oy p(p, g, d) on the 
sum of p + q functions or, with p standard regions R that are triangular, and another 
q that are quadrilateral. These linear programs find their first application in the proof of 
the following proposition. 


21.3. Linear Programs 


To continue with the proof that contravening plane graphs are tame, we need to introduce 
more notation and methods. 
If F is a face of G(D), let 


or(D) = or (D), 


where the sum runs over the set of standard regions associated with F’. This sum reduces 
to a single term unless F is an aggregate in the sense of Remarks 20.19 and 20.18. 


Lemma 21.6. The plane graph of a contravening decomposition star satisfies Property 
6 of tameness: 


y\c(len(F)) > 8. 


F 


Proof. We will show that 
c(len(F)) pt > of(D). (21.1) 


132 The sequence of five inequalities starting with CALC-927432550, Lemma 20.5, and for quads 
CALC-310151857, CALC-655029773, CALC-73283761, CALC-15141595, CALC-574391221, CALC-396281725. 

'33 Although they are closely related, the function tp of three arguments introduced here is distinct from 
the function of two variables of the same name that is introduced in Section 20.1. 

134 CaLc-539256862, CALC-864218323, CALC-776305271, and for quads CALC-310151857, 
CALC-655029773, CALC-73283761, CALC-15141595, CALC-574391221, CALC-396281725. 


296 


Sphere Packings, VI 225 


Assuming this, the result follows for contravening stars D: 


do cdlen(F)) pt = Yl or(D) 
F 


F 
= o(D) > 8pt. 


We consider three cases for inequality (21.1). In the first case assume that the face 
F corresponds to exactly one standard region in the decomposition star. In this case 
inequality (21.1) follows directly from the bounds of Lemma 20.2: 


OF(D) < Sy < c(n) pt. 


In the second case assume the context of a pentagon F formed in Remark 20.18. 
Then, again by Lemma 20.2, we have 


or(D) < 83 + 8g < (c(3) + €(8)) pt < c(5) pt. 


(Just examine the constants c(k).) 
In the third case we consider the situation of Remark 20.19. The six standard regions 
give 


Or(D) < s5 + opp(5, 0, 27 — 1.153) < c(8) pt. 


The constant 1.153 comes from Lemma 20.14. 


Proposition 21.7. Let F be a face of a contravening plane graph G(D). Then 


tr(D) > d(len(F)) pt. 


Proof. Similar. 


Lemma 21.8. Jf v is a vertex of an exceptional standard region, and if there are six 
standard regions meeting at v, then the exceptional region is a pentagonal region and 
the other five standard regions are triangular. 


Proof. There are several cases according to the number k of triangular regions at the 
vertex. 

(k < 2) If there are at least four nontriangular regions at the vertex, then the sum 
of interior angles around the vertex is at least 4(1.153) + 2(0.8638) > 27, which is 
impossible. (See Lemma 20.14.) 

(k = 3) If there are three nontriangular regions at the vertex, then t(D) is at least 
2t4 + ts + trp(3, 0, 27 — 3(1.153)) > (4g — 8) pt. 

(k = 4) If there are two exceptional regions at the vertex, then t(D) is at least 
2ts + trp(4, 0, 27 — 2(1.153)) > (4g — 8) pt. 

If there are two nontriangular regions at the vertex, then t(D) is at least ts + 
tip(4, 1, 27 — 1.153) > (4 — 8) pt. 

(k = 5) We are left with the case of five triangular regions and one exceptional region. 
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Fig. 21.1. Nonadjacent vertices of degree 6 on a pentagon. 


When there is an exceptional standard region at a vertex of degree 6, we claim that the 
exceptional region must be a pentagon. If the region is a heptagon or more, then t(D) 
is at least t7 + tLp(5, 0, 27 — 1.153) > (4aré — 8) pt. 

If the standard region is a hexagon, then t(D) is at least f+ TLp(5, 0, 277 —1.153) > to. 
Also, 86 + orp(5, 0, 277 — 1.153) < so. The aggregate of the six standard regions is nine- 
sided. Lemma 20.3 gives the bound of 8 pt. | 


Lemma 21.9. Consider the standard regions of a contravening star D. 


1. Ifa vertex of a pentagonal standard region has degree 6, then the aggregate F of 
the six faces satisfies 


or(D) < 38g, 
Tr(D) > tg. 


2. An exceptional standard region has at most two vertices of degree 6. If there are 
two, then they are nonadjacent vertices on a pentagon, as shown in Fig. 21.1. 


Proof. We begin with the first part of the lemma. The sum Tt; (D) over these six standard 
regions is at least 

ts + trp(5, 0, 27 — 1.153) > tg. 
Similarly, 

55 + o,p(5, 0, 27 — 1.153) < sg. 


We note that there can be at most one exceptional region with a vertex of degree 6. 
Indeed, if there are two, then they must both be vertices of the same pentagon: 


tg + ts > (40e — 8) pt. 


Such a second vertex on the octagonal aggregate leads to one of the following constants 
greater than (47 ¢ — 8) pt. These same constants show that such a second vertex on a 
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hexagonal aggregate must share two triangular faces with the first vertex of degree 6. 
tg + tip(4, 0, 27 — 1.32 — 0.8638), or 
tg + 1.47 pt+ tip(4, 0, 27 — 1.153 — 0.8638), or 
tg + tip(5, 0, 27% — 1.153). 


(The relevant constants are found at Lemmas 20.13 and 20.14.) 


21.4. A Noncontravening 4-Circuit 


This subsection rules out the existence of a particular 4-circuit on a contravening plane 
graph. The interior of the circuit consists of two faces: a triangle and a pentagon. The 


circuit and its enclosed vertex are show in Fig. 20.2 with vertices marked p,,..., ps. 
The vertex p; is the enclosed vertex, the triangle is (p1, p2, ps), and the pentagon is 
(Pi,---, ps). Let vj,..., v4, Us be the corresponding vertices of U(D). 


The diagonals {vs5, v3} and {v2, v4} have length at least nf 2. by Lemma 4.19. If an 
interior angle of the quadrilateral is less than 1.32, then, by Lemma 20.16, |v; —v3| < V8. 
Thus, we assume in the following lemma that all interior angles of the quadrilateral 
aggregate are at least 1.32. 


Lemma 21.10. A decomposition star that contains this configuration does not contra- 
vene. 


Proof. Let P denote the quadrilateral aggregate of these two standard regions. By 
Lemma 20.12 we have tp(D) > 11.16 pt. There are no other exceptional faces, because 
11.16pt+ ts > (42r¢ — 8) pt. Every vertex not on P has type (5, 0), by Lemma 20.6. In 
particular, there are no quadrilateral regions. The interior angles of P are at least 1.32. 
There are at most four triangles at every vertex of P, because 


11.16 pt+ tp(5, 0, 27 — 1.32) > (4g — 8) pt. 


There are at least three triangles at every vertex of P, otherwise we contradict Lem- 
mas 20.9 or 20.11. 

The only triangulation with these properties is obtained by removing one edge from 
the icosahedron (Exercise). This implies that there are two opposite corners of P each 
having four quasi-regular tetrahedra. Since the diagonals of P have lengths greater than 
2,/2, the results of CALC-325738864 show that the union F of these eight quasi-regular 
tetrahedra satisfies 


Tr(D) => 2(1.5) pt. 
There are two additional vertices of type (5, 0) whose tetrahedra are distinct from these 


eight quasi-regular tetrahedra. They give an additional 2(0.55) pt. Now (11.16+2(1.5)+ 
2(0.55)) pt > (4¢ — 8) pt by Lemma 20.7. The result follows. 


Lemma 21.11. A contravening plane graph satisfies Property 5 of tameness: If a vertex 
is contained in an exceptional face, then the degree of the vertex is at most five. 
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Proof. An exceptional standard region with a vertex of degree 6 must be pentagonal 
by Lemma 21.9. If that pentagonal region has two or more such vertices, then by the 
same lemma it must be the arrangement shown in Fig. 21.1. This arrangement does not 
appear on a contravening graph by Lemma 21.10. 


Remark 21.12. We have now fully justified the claim made in Remark 20.19: there is 
at most one vertex on six standard regions, and it is part of an aggregate in such a way 
that it does not appear as the vertex of G(D). 


21.5. Possible 4-Circuits 


Every 4-circuit divides a plane graph into two aggregates of faces that we may call the 
interior and exterior. We call vertices of the faces in the aggregate that do not lie on the 
4-cycle enclosed vertices. Thus, every vertex lies in the 4-cycle, is enclosed over the 
interior, or is enclosed over the exterior. 

Lemma 20.11 asserts that either the interior or the exterior has at most one enclosed 
vertex. When choosing which aggregate is to be called the interior, we may make our 
choice so that the interior has area at most 277, and hence contains at most one vertex. 
With this choice, we have the following proposition. 


Proposition 21.13. Let D be a contravening plane graph. A 4-circuit surrounds one 
of the aggregates of faces shown in Property 3 of tameness. 


Proof. If there are no enclosed vertices, then the only possibilities are for it to be a 
single quadrilateral face or a pair of adjacent triangles. 

Assume there is one enclosed vertex v. If v is connected to three or four vertices of 
the quadrilateral, then that possibility is listed as part of the conclusion. 

If v is connected to two opposite vertices in the 4-cycle, then the vertex v has type 
(0, 2) and the bounds of Lemma 20.6 show that the graph cannot be contravening. 

If v is connected to two adjacent vertices in the 4-cycle, then we appeal to Lemma 21.10 
to conclude that the graph does not contravene. 

If v is connected to at most one vertex, then we appeal to Lemma 20.10. This completes 
the proof. O 


22. Weight Assignments 
The purpose of this section is to prove the existence of a good admissible weight assign- 


ment for contravening plane graphs. This will complete the proof that all contravening 
graphs are tame. 


Theorem 22.1. Every contravening plane graph has an admissible weight assignment 
of total weight less than tgt = 14.8. 
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Given a contravening decomposition star D, we define a weight assignment w by 
Fe w(F) = tr(D)/pt. 


Since D contravenes, 


dw) = Do te (D)/pt 
F 


F 

= t(D)/pt 
(4g — 8) pt/pt 
< tgt= 14.8. 


IA 


The challenge of the theorem will be to prove that w, when defined by this formula, is 
admissible. 
22.1. Admissibility 


The next three lemmas establish that this definition of w(/) for contravening plane 
graphs satisfies the first three defining properties of an admissible weight assignment. 


Lemma 22.2. Let F be a face of length n in a contravening plane graph. Define w(F) 
as above. Then w(F) > d(n). 


Proof. This is Proposition 21.7. 


Lemma 22.3. Let v be a vertex of type (p,q) in a contravening plane graph. Define 
w(F) as above. Then 


> 5 w(F) = bp, 4). 


veF 


Proof. This is Lemma 20.6. 


Lemma 22.4. Let V be any set of vertices of type (5, 0) in a contravening plane graph. 
Define w(F) as above. If the cardinality of V is k < 4, then 


be w(F) > 0.55k. 


VAF#O 
Proof. This is Lemma 20.7. O 


The following proposition establishes the final property that w(F’) must satisfy to 
make it admissible. Separated sets are defined in Section 18.2. 
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Proposition 22.5. Let V be any separated set of vertices in a contravening plane graph. 
Define w(F) as above. Then 


Y= (w(F) — d(len(F))) = Yo attri(v)), 
VAFHAD veV 


where tri(v) denotes the number of triangles containing the vertex v. 


The proof occupies the rest of this section. Since the degree of each vertex is 5, and 
there is at least one face that is not a triangle at the vertex, the only constants tri(v) that 
arise are 

tri(v) € {0,..., 4}. 
We will prove that in a contravening plane graph Conditions | and 4 of a separated set 
are incompatible with the condition tri(v) < 2, for some v € V. This will allows us to 
assume that 
tri(v) € {3, 4}, 
for all v € V. These cases are treated in Section 22.3. 

First we prove the inequality when there are no aggregates involved. Afterwards, we 

show that the conclusions can be extended to aggregate faces as well. 


22.2. Proof that tri(v) > 2 


In this subsection D is a contravening decomposition star with associated graph G(D). 
Let V be a separated set of vertices in G(D). Let v be a vertex in V such that none of its 
faces is an aggregate in the sense of Remarks 20.18 and 20.19. 


Lemma 22.6. Under these conditions, for every v € V, tri(v) > 1. 


Proof. If there are p triangles, g quadrilaterals, and r other faces, then 


t(D) = } > tr(D) 


veR 
> rts+trp(p,qg, 2m —r(1.153)). 


If there is a vertex w that is not on any of the faces containing v, then the sum of 
tr(D) over the faces containing w yields an additional 0.55 pt by Lemma 20.7. We 
calculate these constants for each (p, q, r) and find that the bound is always greater than 
(42 ¢ — 8) pt. This implies that D cannot be contravening. 


(p,9,1) Lower bound Justification 
(0, 5, 0) 22.27 pt Lemma 20.6 
(O,qg,r>1) ts +4tq © 14.41 pt 
(1, 4, 0) 17.62 pt Lemma 20.6 
(1, 3, 1) ts + 12.58 pt (tp) 

(, 2, 2) 2ts + 7.53 pt (tp) 


(1,qg,r => 3) 3ts + t4 
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Lemma 22.7. Under these same conditions, for every v € V, tri(v) > 2. 


Proof. Assume that tri(v) = 2. We will show that this implies that D does not contra- 
vene. Let e be the number of exceptional faces at v. We have e + tri(v) < 5. 

The constants 0.55 pt and 0.48 pt used throughout the proof come from Lemma 20.7. 
The constants f, comes from Lemma 20.2. 

(e = 3) First, assume that there are three exceptional faces around vertex v. They must 
all be pentagons (2ts + ft > (4 — 8) pt). The aggregate of the five faces is an m-gon 
(some m < 11). If there is a vertex not on this aggregate, use 3f5-+0.55 pt > (42 ¢ —8) pt. 
So there are at most nine triangles away from the aggregate, and 


o(D) < 9pt+ GBss + 2pt) < 8 pt. 


The argument is the same if there is a quad, a pentagon, or a hexagon (t4 + t6 = 
2ts, 84 + S86 = 255). 

(e = 2) Assume next that there are two pentagons and a quadrilateral around the 
vertex. The aggregate of the two pentagons, quadrilateral, and two triangles is an m-gon 
(some m < 10). There must be a vertex not on the aggregate of five faces, for otherwise 
we have 


o(D) < 8pt+ (255 + 2pt) < 8 pt. 


The interior angle of one of the pentagons is at most 1.32. For otherwise, 
tp(2, 1, 2% — 2(1.32)) + 2ts + 0.55 pt > (4g — 8) pt. 

Lemma 20.13 shows that any pentagon R with an interior angle less than 1.32 yields 
Tr(D) => ts + (1.47pt). If both pentagons have an interior angle < 1.32 the lemma 
follows easily from this calculation: 2(t5 + 1.47 pt) pt + tip(2, 1, 27 — 2(1.153)) + 
0.55 pt > (4¢ — 8) pt. If there is one pentagon with an angle > 1.32, we then have 
ts + (1.47 pt) + trp(2, 1, 27 — 1.153 — 1.32) + ts + 0.55 pt > (4aré — 8) pt. 

(e = 1) Assume finally that there is one exceptional face at the vertex. If it is a 
hexagon (or more), we are done: t¢ + tLp(2, 2, 27 — 1.153) > (42 — 8) pt. Assume it is 
a pentagon. The aggregate of the five faces at the vertex is bounded by an m-circuit (some 
m < 9). If there are no more than nine quasi-regular tetrahedra outside the aggregate, 
then o (D) is at most (9 — 2(0.48)) pt+-s5 + oLp(2, 2, 27 — 1.153) < 8 pt (Lemma 20.7). 
So we may assume that there are at least three vertices not on the aggregate. 

If the interior angle of the pentagon is greater than 1.32, then 


tip(2, 2, 27 — 1.32) + 3(0.55) pt + ts > (4re — 8) pt; 
if it is less than 1.32, by Lemma 20.13 


trp(2, 2, 27 — 1.153) + 3(0.55)pt + 1.47 pt+ ts > (4¢ — 8) pt. 


Lemma 22.8. The bound tri(v) > 2 holds if v is a vertex of an aggregate face. 


Proof. The exceptional region enters into the preceding two proofs in a purely formal 
way. Pentagons enter through the bounds 


ts, S5, 1.47 pt 
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and angles 1.153, 1.32. Hexagons enter through the bounds 
fo, S6 


and so forth. These bounds hold for the aggregate faces. Hence the proofs hold for 
aggregates as well. O 


22.3. Bounds when tri(v) € {3, 4} 


In this subsection D is a contravening decomposition star with associated graph G(D). 
Let V be a separated set of vertices. For every vertex v in V, we assume that none of its 
faces is an aggregate in the sense of Remarks 20.18 and 20.19. We assume that there are 
three or four triangles containing v, for every uv € V. 

To prove Inequality (4) in the definition of admissible weight assignments, we rely on 
the following reductions. Define an equivalence relation on exceptional faces by F ~ F’ 
if there is a sequence Fy = F,..., F, = F’ of exceptional faces such that consecutive 
faces share a vertex of type (3, 0, 2). (That is, tri(v) = 3.) Let F be an equivalence class 
of faces. 


Lemma 22.9. Let V be a separated set of vertices. For every equivalence class of 
exceptional faces F,, let V(F) be the subset of V whose vertices lie in the union of faces 
of F. Suppose that for every equivalence class F, Inequality (4) (in the definition of 
admissible weight assignments) holds for V(F). Then the inequality holds for V. 


Proof. By construction, each vertex in V lies in some F, for an exceptional face. 
Moreover, the separating property of V ensures that the triangles and quadrilaterals in 
the inequality are associated with a well-defined F. Thus, the inequality for V is a sum 
of the inequalities for each V(F). O 


Lemma 22.10. Let v be a vertex in a separated set V at which there are p triangles, 
q quadrilaterals, and r other faces. Suppose that for some p' < p and q' < q, we have 


tip(p’, g’, a) > (p'd(3) + q'd(4) + a(p)) pt 


for some upper bound a on the angle occupied by p’ triangles and q' quadrilaterals at 
v. Suppose further that Inequality 4 (in the definition of admissible weight assignments) 
holds for the separated set V'’ = V\{v}. Then the inequality holds for V. 


Proof. Let Fi,..., Fn, m = p'+q’, be faces corresponding to the triangles and 
quadrilaterals in the lemma. The hypotheses of the lemma imply that 


m 


Y\(wF;(D) — d(len(F;))) > a(p). 
1 


Clearly, the inequality for V is the sum of this inequality, the inequality for V’, and 
d(n) > 0. 
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Recall that the central vertex of a flat quarter is defined to be the one that does not lie 
on the triangle formed by the origin and the diagonal. 


Lemma 22.11. Let R be an exceptional standard region. Let V be a set of vertices of 
R. If v € V, let py be the number of triangular regions at v and let qy be the number of 
quadrilateral regions at v. Assume that V has the following properties: 


1. The set V is separated. 

2. Ifv € V, then there are five standard regions at v. 

3. If v € V, then the corner over v is a central vertex of a flat quarter in the cone 
over R. 

4. Ifv € V, then p, > 3. That is, at least three of the five standard regions at v are 
triangular. 

5. If R' # R is an exceptional region at v, and if R has an interior angle at least 
1.32 at v, then R' also has an interior angle at least 1.32 at v. 


Let F be the union of {R} with the set of triangular and quadrilateral regions that have 
a vertex at some v € V. Then 


tr(D) > ) (ppd (3) + qud (4) + a(po)) pt. 


veV 
Proof. Tf (pv, qv) = (3, 1) and the internal angle of R at v is at least 1.32, then we use 
tip(3, 1, 27 — 1.32) > 1.4 pt ta. 


In this case the inequality of the lemma is a consequence of this inequality and the 
inequality for V\{v}. Thus, we may assume without loss of generality that if (py, qu) = 
(3, 1), then the internal angle of R at v is at most 1.32. The conclusion now follows 
from Lemma 14.6. 


Lemma 22.12. Property 4 of admissibility holds. That is, let V be any separated set 
of vertices. Then 


D2 W(F) — dden(F))) = D7 attri). 


F: VNF#O veV 


Proof. Let V be a separated set of vertices. The results of Section 22.2 reduce the 
lemma to the case where tri(v) € {3, 4} for every vertex v € V. 
We say that there is a flat quarter centered at v, if the corner v’ over v is the central 
vertex of a flat quarter and that flat quarter lies in the cone over an exceptional region. 
One case is easy to deal with. Assume that there are three triangles, a quadrilateral, 
and an exceptional face at the vertex. Assume the interior angle on the exceptional region 
is at least 1.32; then 


tip(3, 1, 27 — 1.32) > 1.4 ptt ta. (22.1) 


This gives the bound in the sense of Lemma 22.10 at such a vertex. For the rest of the 
proof, assume that the interior angle on the exceptional region is less than 1.32 at vertices 
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of type (p, g,r) = (3, 1, 1). This implies in particular by Lemma 20.16 that there is a 
flat quarter centered at each vertex of this type. 

Let v be a vertex with no flat quarter centered at v. By Lemma 20.16, the interior 
angles of the exceptional regions at v are at least 1.32. It follows!*> that 


TLP(Pvs qu, &) > (Pvd(3) + qud(A) + a(pv)) pt. (22.2) 


Thus, by Lemma 22.10, we reduce to the case where for each v € V, there is a flat 
quarter centered at v. Assume that V has this property. 

Pick a function f from the set V to the set of exceptional standard regions as follows. 
If there is only one exceptional region at v, then let f(v) be that exceptional region. If 
there are two exceptional regions at v, then let f(v) be one of these two exceptional 
regions. Pick it to be an exceptional region with an interior angle at most 1.32 if one of 
the two exceptional regions has this property. Pick it to have a flat quarter centered at v. 
Note that by Lemma 20.16, if the exceptional region has an interior angle at most 1.32, 
then f(v) will have a flat quarter centered at v. 

For each exceptional region R, let 


Vr={veV: f(v) = R}. 
By Lemma 22.11, Property 4 of admissibility is satisfied for each Vr. Since this prop- 


erty is additive in Vr and since V is the disjoint union of the sets Vr, the proof is 
complete. 


22.4. Weight Assignments for Aggregates 
Lemma 22.13. Consider a separated set of vertices V on an aggregated face F as in 


Remark 20.18. Then inequality 4 holds (in the definition of admissible weight assign- 
ments): 


>) (w(F) - dien(F))) = DV a(ir)). 


VAF#O veV 


Proof. We may assume that tri(v) € {3, 4}. 

First consider the aggregate of Remark 20.18 of a triangle and an eight-sided re- 
gion, with pentagonal hull F’. There is no other exceptional region in a contravening 
decomposition star with this aggregate: 

tg tts > (4a¢ — 8) pt. 


A separated set of vertices V on F has cardinality at most 2. This gives the desired bound 


tg > ts + 2(1.5) pt. 
135 CaLc-551665569, CALC-824762926, and CALC-325738864. 
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Next, consider the aggregate of a hexagonal hull with an enclosed vertex. Again, there 
is no other exceptional face. If there are at most k < 2 vertices in a separated set, then 
the result follows from 


tg > t¢ + k(1.5) pt. 


There are at most three vertices in V on a hexagon, by the nonadjacency conditions 
defining V. A vertex v can be removed from V if it is not the central vertex of a flat 
quarter (Lemma 22.10 and inequalities (22.1) and (22.2)). If there is an enclosed vertex 
w, it is impossible for there to be three nonadjacent vertices, each the central vertex of 
a flat quarter: 


ED al S wl 80/8, Diy. Dig, 2) = Dig 


(E is as defined in Definition 4.14.) 
Finally consider the aggregate of a pentagonal hull with an enclosed vertex. There 
are at most k < 2 vertices in a separated set in F. There is no other exceptional region: 


t7+ts > (40¢ — 8) pt. 


The result follows from 


t7 > ts + 2(1.5) pt. 


Lemma 22.14. Consider a separated set of vertices V on an aggregate face of a 
contravening plane graph as in Remark 20.19. Inequality 4 holds in the definition of 
admissible weight assignments. 


Proof. There is at most one exceptional face in the plane graph: 
tg +ts > (4c — 8) pt. 


Assume first that an aggregate face is an octagon (Fig. 20.4). At each of the vertices of 
the face that lies on a triangular standard region in the aggregate, we can remove the 
vertex from V using Lemma 22.10 and the estimate 


tip(4, 0, 27% — 2(0.8638)) > 1.5 pt. 


This leaves at most one vertex in V, and it lies on a vertex of F which is “not aggregated,” 
so that there are five standard regions of the associated decomposition star at that vertex, 
and one of those regions is pentagonal. The value a(4) = 1.5 pt can be estimated at this 
vertex in the same way it is done for a nonaggregated case in Section 22.3. 

Now consider the case of an aggregate face that is a hexagon (Fig. 20.4). The argument 
is the same: we reduce to V containing a single vertex, and argue that this vertex can 
be treated as in Section 22.3. (Alternatively, use the fact that the pentagon-triangle 
combination in this aggregate has been eliminated by Lemma 21.10.) 


The proof that contravening plane graphs are tame is complete. 
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23. Linear Program Estimates 


We have completed a major portion of the proof of the Kepler conjecture by proving that 
every contravening plane graph is tame. 

The final portion of the proof of the Kepler conjecture consists in showing that tame 
graphs are not contravening, except for the isomorphism class of graphs isomorphic to 
Gicc and Ghep associated with the face-centered cubic and hexagonal close packings. 

This part of the proof treats all contravening tame graphs except for the three cases 
Giecs Gpent, ANd Gycp. The two cases Gye and Ghep are treated in Theorem 8.1, and the 
case Gpent iS treated in Paper V. 

The primary tool that will be used is linear programming. The linear programs are 
obtained as relaxations of the original nonlinear optimization problem of maximizing 
o (D) over all decomposition stars whose associated graph is a given tame graph G. The 
upper bounds obtained through relaxation are upper bounds to the nonlinear problem. 

To eliminate a tame graph, we must show that it is not contravening. By definition, this 
means we must show that o (D) < 8 pt. When a single linear program does not yield an 
upper bound under 8 pt, we branch into a sequence of linear programs that collectively 
imply the upper bound of 8 pt. This will call for a sequence of increasingly complex 
linear programs. 

For each of the tame plane graphs produced in Theorem 19.1, we define a linear 
programming problem whose solution dominates the value of o(D) on the set of de- 
composition stars associated with the plane graph. A description of the linear programs 
is presented in this section. 


Theorem 23.1. Jf the plane graph of a contravening decomposition star is isomorphic 
to one in the list [Ha16], then it is isomorphic to one of the following three plane graphs: 
the plane graph of the pentahedral prism, that of the hexagonal-close packing, or that 
of the face-centered cubic packing. 


This theorem is one of the central claims described in Section 3.3 that lead to the 
proof of the Kepler conjecture. 


23.1. Relaxation 


(NLP) Let f: P — R be a function on a nonempty set P. Consider the nonlinear 
maximization problem 


ae f(p). 
(LP) Consider a linear programming problem 


max C:X 


such that A x < b, where A is a matrix, b and c are vectors of real constants, and x is a 
vector of variables x = (x,,...,X,). We write the linear programming problem as 


max(c-x: Ax <b). 


308 


Sphere Packings, VI 237 


An interpretation I of a linear programming problem (LP) is a nonempty set |/], 
together with an assignment x; b> a of functions ae : |[| > R to variables x;. We say 
the constraints Ax < b of the linear program are satisfied under the interpretation J if 
for all p € |/|, 


Ax!(p) <b. 
The interpretation 7 is said to be a relaxation of the nonlinear program (NLP) if the 
following three conditions hold: 
1 P=|/|. 
2. The constraints are satisfied under the interpretation. 
3. f(p) <c-x!(p), forall p € [I]. 


Lemma 23.2. Let (LP) be a linear program with relaxation I to (NLP). Then (LP) has 
a feasible solution. Moreover, if (LP) is bounded above by a constant M, then M is an 
upper bound on the function f: |I| > R. 


Proof. A feasible solution is x; = x/(p), for any p € |1|. The rest is clear. 


Remark 23.3. In general, it is to be expected that the interpretations A x’ < b will be 
nonlinear inequalities on the domain P. In our situation, satisfaction of the constraints 
will be proved by interval arithmetic. Thus, the construction of an upper bound to (NLP) 
breaks into two tasks: to solve the linear programs and to prove the nonlinear inequalities 
required to satisfy the constraints. 


There are many nonlinear inequalities entering into our interpretation. These have 
been proved by interval arithmetic on computer and are listed at [Hal6]. 


Remark 23.4. There is a second method of establishing the satisfaction of inequalities 
under an interpretation. Suppose we wish to show that the inequality e- x < D’ is 
satisfied under the interpretation 7. Suppose that we have already established that a 
system of inequalities A x < b is satisfied under the interpretation J. We solve the linear 
programming problem max(e- x : Ax < b). If this maximum is at most b’, then the 
inequality e - x < Db’ is satisfied under the interpretation 7. We refer to e- x < b’ as an 
LP-derived inequality (with respect to the system Ax < b). 


23.2. The Linear Programs 


Let G be a tame plane graph. Let DS(G) be the space of all decomposition stars whose 
associated plane graph is isomorphic to G. 


Theorem 23.5. For every tame plane graph G other than Gjec, Gpep, and Gpent, there 
exists a finite sequence of linear programs with the following properties: 


1. Every linear program has an admissible solution and its solution is strictly less 
than 8 pt. 
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2. For every linear program in this sequence, there is an interpretation I of the linear 
program that is a relaxation of the nonlinear optimization problem 


o: |I| > R, 


where |I| is a subset of DS(G). 
3. The union of the subsets |I|, as we run over the sequence of linear programs, is 


DS(G). 


The proof is constructive. For every tame plane graph G a sequence of linear programs 
is generated by computer and solved. The optimal solutions are all bounded above by 8 pt. 
It will be clear from construction of the sequence that the union of the sets |7| exhausts 
DS(G). We estimate that nearly 10° linear programs are involved in the construction. 
The rest of this paper outlines the construction of some of these linear programs. 


Remark 23.6. Section 3.1.1 of [Hal4] shows how the linear programs that arise in 
connection with the Kepler conjecture can be formulated in such a way that they always 
have a feasible solution and so that the optimal solution is bounded. We assume that all 
our linear programs have been constructed in this way. 


Corollary 23.7. If a tame graph G is not isomorphic to Gfec, Gpcp, OF Gpent, then it is 
not contravening. 


Proof. This follows immediately from Theorem 23.5 and Lemma 23.2. 


23.3. Basic Linear Programs 


Let G be a tame plane graph. Specifically, G is one of the several thousands of graphs 
that appear in the explicit classification [Ha1l6]. 

To describe the basic linear program, we need the following indexing sets. Let VERTEX 
be the set of all vertices in G. Let FACE be the set of all faces in G. (Recall that by 
construction each face F of the graph carries an orientation.) Let ANGLE be the set of all 
angles in G, defined as the set of pairs (v, F’), where the vertex v lies in the face F’. Let 
DIRECTED be the set of directed edges. It consists of all ordered pairs (v, s(v, F')), where 
s(v, F) denotes the successor of the vertex v in the oriented face F’. Let TRIANGLES be 
the subset of FACE consisting of those faces of length 3. Let UNDIRECTED be the set of 
undirected edges. It consists of all unordered pairs {v, s(v, F)}, for v € F. 

We introduce variables indexed by these sets. Following AMPL notation, we write 
for instance y{VERTEX} to declare a collection of variables y[v] indexed by vertices v 
in VERTEX. With this in mind, we declare the variables 


Ez 


a{ANGLE}, y{VERTEX}, e{UNDIRECTED}, 
o {FACE}, T{FACE}, sol{FACE}. 


We obtain an interpretation J on the compact space DS(G). First, we define an 
interpretation at the level of indexing sets. A decomposition star determines the set 


GI 
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U(D) of vertices of height at most 2f9 from the origin of D. Each decomposition star 
D € DS(G) determines a (metric) graph with geodesic edges on the surface of the unit 
sphere, which is isomorphic to G as a (combinatorial) plane graph. There is a map from 
the vertices of G to U(D) given by v + Uv’, if the radial projection of v/ to the unit 
sphere at the origin corresponds to v under this isomorphism. Similarly, each face F 
of G corresponds to a set F’ of standard regions. Each edge e of G corresponds to a 
geodesic edge e’ on the unit sphere. 

Now we give an interpretation J to the linear-programming variables at a decompo- 
sition star D. As usual, we add a superscript J to a variable to indicate its interpretation. 
Let a[v, F]’ be the sum of the interior angles at v! of the metric graph in the standard 
regions F’. Let y[v]! be the length |v/| of the vertex v’ € U(D) corresponding to v. 
Let e[v, w]! be the length |v’ — w’| of the edge between v! and w! € U(D). Let 


o[F]’ = of(D), 
sol[F]/ = sol(F’), 
t[FV’ = t(D). 
The objective function for the optimization problems is 


max : a o[F]. 
Fe FACE 


Its interpretation under J is the score o (D). 
We can write a number of linear inequalities that will be satisfied under our interpre- 
tation. For example, we have the bounds 


0 < ylv] < 2h, v € VERTEX, 
0 < e[v, w] < 2h, (v, w) € EDGE, 

0 < alv, F] < 27, (v, F) € ANGLE, 
0 < sol[F] < 47 F € FACE. 


There are other linear relations that are suggested directly by the definitions or the 
geometry. Here, v belongs to VERTEX: 


tLF] = sol[F]¢ pt—o[F], 
2x = )° aly, FI, 
F:véF 
sol[F] = > aly, F]— (len(F) — 2)z. 
veF 


There are long lists of additional inequalities that come from interval arithmetic verifi- 
cations. Many are specifically designed to give relations between the variables. 


o[F], t[F], alv, F], 
sol[F], y[v], elv, w], 
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whenever F’ is a single standard region having three sides. Similarly, other computer 
calculations give inequalities for o[ F] and related variables, when the length of F is 4. 
A complete list of inequalities that are used for triangular and quadrilateral faces is found 
in [Hal6]. 

For exceptional faces, we have an admissible weight function w(F’). According to 
definitions w(F) = t[F]/pt, so that the inequalities for the weight function can be 
expressed in terms of the linear program variables. 

When the exceptional face is not an aggregate, then it also satisfies the inequalities 
of Lemma 20.2. 


23.4. Error Analysis 


The variables of the linear programming problem are the dihedral angles, the scores of 
each of the standard clusters, and their edge lengths. 

We subject these variables to a system of linear inequalities. First, the dihedral angles 
around each vertex sum to 27. The dihedral angles, solid angles, and score are related 
by various linear inequalities as described in Section 23.3. The solid-angle variables are 
linear functions of dihedral angles. We have 


o(D) =o5,(D) +--+ +05,(D) + or, (D) +--+ +R, (D). 


Forgetting the origin of the scores, solid angles, and dihedral angles as nonlinear functions 
of the standard clusters and treating them as formal variables subject only to the given 
linear inequalities, we obtain a linear programming bound on the score. 

Floating-point arithmetic was used freely in obtaining these bounds. The linear pro- 
gramming package CPLEX was used (see www.cplex.com). However, the results, once 
obtained, could be checked rigorously as follows.!*° 

We present an informal analysis of the floating-point errors. For each quasi-regular 
tetrahedron S; we have a nonnegative variable x; = pt — o(S;). For each quad cluster 
Rx, we have a nonnegative variable x, = —o(R,). A bound on o (D) is p pt — a Kis 
where p is the number of triangular standard regions, and J indexes the faces of the 
plane graph. We give error bounds for a linear program involving scores and dihedral 
angles. Similar estimates can be made if there are edges representing edge lengths. Let 
the dihedral angles be x;, for j in some indexing set J. Write the linear constraints as 
Ax < b. We wish to maximize c - x subject to these constraints, where c; = —1, for 
i € I,andc; = 0, for j € J. Let z be an approximate solution to the inequalities zA > c 
and z > 0 obtained by numerical methods. Replacing the negative entries of z by 0 we 
may assume that z > 0 and that zA; > c; — e, fori € J U J, and some small error e. If 
we obtain the numerical bound p pt + z- b < 7.9999 pt, and if ¢ < 107%, then o(D) is 
less than 8 pt. In fact, we note that 

(re) 
l+e 


'36 The output from each linear program that has no exceptional regions has been double checked with 
interval arithmetic. Predictably, the error bounds presented here were satisfactory (1/2002). 
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is at least c; fori € I (since c; = —1), and that it is greater than c; — ¢/(1+ ¢), fori € J 
(since c; = 0). Thus, if N < 60 is the number of vertices, and p < 2(N — 2) < 116is 
the number of triangular faces, 


o(D) < t+e-x< t+ noe Ard = 
= Ppp = PP Le iteQ" 


jE a Fae 
— + —2n 
BY ee” Lee 


ppt+z-b+e(ppt+2nN) 

(1+ e) 
4 7.9999 pt + 10-8(116 pt + 500) - 
~ 1+ 10-8 


In practice, we used 0.4429 < 0.79984 pt as our cutoff, and N < 14 in the interesting 
cases, so that much tighter error estimates are possible. 


8 pt. 


24. Elimination of Aggregates 


The proof of the following theorem occupies the entire section. It eliminates all the 
pathological cases that we have had to carry along until now. 


Theorem 24.1. Let D be a contravening decomposition star, and let G be its tame 
graph. Every face of G corresponds to exactly one standard region of D. No standard 
region of D has any enclosed vertices from U(D). (That is, a decomposition star with 
one of the aggregates shown in Fig. 20.1 is not contravening.) 


24.1. Triangle and Quad Branching 


Section 25 discusses branch and bound strategies. Branch and bound strategies replace 
a single linear program with a series of linear program, when a single linear program 
does not suffice. There is one case of branch and bound that we need before Section 25. 
This is a branching on triangular and quadrilateral faces. 

We divide triangular faces with corners v1, v2, v3 into two cases: 


e[v,, vo] + e[v2, v3] + e[v3, v1] < 6.25, 
e[v1, v2] + e[v2, v3] + e[v3, v1] = 6.25, 


whenever sufficiently good bounds are not obtained as a single linear program. We 
also divide quadrilateral faces into four cases: two flat quarters, two flat quarters with 
a diagonal running in the other direction, four upright quarters forming a quartered 
octahedron, and the mixed case. (A mixed case by definition is any case that is not one 
of the other three.) In general, if there are 7; triangles and rz quadrilaterals, we obtain as 
many as 2’'*?”? cases by breaking the various triangles and quadrilaterals into subcases. 

We break triangular faces and quadrilaterals into subcases, as needed in the linear 
programs that follow, without further comment. 


313 


242 T. C. Hales 
24.2. A Pentagonal Hull with n = 8 


The next few sections treat the nonpolygonal standard regions described in Remark 
20.18. In this subsection there is an aggregate of the octagonal region and a triangle has 
a pentagonal hull. Let P denote this aggregate. 


Lemma 24.2. Let G be acontravening plane graph with the aggregate of Remark 20.18. 
Some vertex on the pentagonal face has type not equal to (3, 0, 1). 


Proof. If every vertex on the pentagonal face has type (3, 0, 1), then at the vertex of the 
pentagon meeting the aggregated triangle, the four triangles together with the octagon 
give 
tg + ~ tp (4, 0, 27 — 2(1.153)) > (4re — 8) pt, 
(4) 


so that the graph does not contravene. 


For a general contravening plane graph with this aggregate, we have bounds 


or(D) < ptt+Ss, 


tr(D) = fg. 


We add the inequalities t[F] > tg and o[F'] < pt+ sg to the exceptional face. There 
is no other exceptional face, because tg + ts; > (42r¢ — 8) pt. We run the linear pro- 
grams for all tame graphs with the property asserted by Lemma 24.2. Every upper 
bound is less than 8 pt, so that there are no contravening decomposition stars with this 
configuration. 


24.3. n= 8, Hexagonal Hull 


We treat the two cases from Remark 20.18 that have a hexagonal hull (Fig. 20.1). One 
can be described as a hexagonal region with an enclosed vertex that has height at most 
2t) and distance at least 2% from each corner over the hexagon. The other is described 
as a hexagonal region with an enclosed vertex of height at most 2%), but this time with 
distance less than 2¢) from one of the corners over the hexagon. 

The argument for the case n = 8 with a hexagonal hull is similar to the argument of 
Section 24.2. Add the inequalities t[R] > tg and o[R] < sg for each hexagonal region. 
Run the linear programs for all tame graphs, and check that these additional inequalities 
yield linear programming bounds under 8 pt. 


24.4. n=7, Pentagonal Hull 
We treat the two cases illustrated in Fig. 20.1 that have a pentagonal hull. These cases 


require more work. One can be described as a pentagon with an enclosed vertex that has 
height at most 2/9 and distance at least 29 from each corner of the pentagon. The other 
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is described as a pentagon with an enclosed vertex of height at most 2%), but this time 
with distance less than 2f9 from one of the corners of the pentagon. 


In discussing various maps, we let v; be the corners of the regions, and we set y; = |v;| 
and yj; = |v; — v;|. The subscript F is dropped, when there is no great danger of 
ambiguity. 


Add the inequalities t[F'] > tj, o[F'] < 57 for the pentagonal face. There is no other 
exceptional region, because f5 + t7 > (4 — 8) pt. With these changes, of all the tame 
plane graphs with a pentagonal face and no other exceptional face, all but one of the 
linear programs give a bound under 8 pt. 

The plane graph Go that remains is easy to describe. It is the plane graph with eleven 
vertices, obtained by removing from an icosahedron a vertex and all five edges that meet 
at that vertex. 

We treat the case Go. Let vi2 be the vertex enclosed over the pentagon. We let 
U1,..-, Us be the five corners of U(D) over the pentagon. Break the pentagon into five 
simplices along {0, vj2}: S$; = {0, vi2, v;, v;41}. We have LP-derived bounds (in the sense 
of Remark 23.4) y[v;] < 2.168, and a[v;, F] < 2.89, fori = 1, 2,3, 4, 5. In particular, 
the pentagonal region is convex, for every contravening star D € DS(Go). 

Further LP-derived inequalities are 


o[F] > —0.2345 and t[F] < 0.644. 


By using branch and bound arguments on the triangular faces, as described in Sec- 
tion 24.1, we can improve the LP-derived inequality to 


t[F] < 0.6079. 
Another LP-derived inequality gives a bound on the perimeter: 


Yo lui = visi] < 11.407. 


Yet another LP-derived inequality states that if v;, v2, v3 are consecutive corners over 
the pentagonal region, then 


|vy — vo| + |v2 — v3| < 4.804. 


Lemma 24.3. Assume that R is a pentagonal standard region with an enclosed vertex 
v of height at most 2ty. Assume further that 


e |v;| < 2.168 for each of the five corners. 

e Each interior angle of the pentagon is at most 2.89. 

e If vj, v2, v3 are consecutive corners over the pentagonal region, then |v; — v2| + 
|v2 — v3| < 4.804. 

e >¢5 |v; — vi4i| < 11.407. 


Then og(D) < —0.2345 or tr(D) > 0.6079. 


Proof. This is Lemma 14.5. 


Since the bound tr(D) > 0.6079 contradicts the LP-derived inequality t[F] < 
0.6079, this case does not occur in a contravening graph. 
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24.5. Type (p,q,r) = (5,0, 1) 


We return briefly to the case of six standard regions around a vertex discussed in Re- 
mark 20.19. In the plane graph they are aggregated into an octagon. We take each of 
the remaining cases with an octagon, and replace the octagon with a pentagon and six 
triangles around a new vertex. There are eight ways of doing this. All eight ways in each 
of the cases gives an LP bound under 8 pt. This completes this case. 

The second aggregate shown in Fig. 20.4 contains a pentagon-triangle combination 
that was ruled out by Lemma 21.10. 


24.6. Summary 
Lemma 24.4. None of the aggregates of Remarks 20.19 and 20.18 appears in a con- 


travening star. In particular, all regions are bounded by simple polygons, and each face 
of the graph G(D) corresponds to exactly one standard region. 


Proof. The proof is the main result of this section. 


25. Branch and Bound Strategies 


When a single linear program does not give sufficiently good bounds, we apply branch 
and bound methods to improve the bound. By branching repeatedly, we are able to show 
in every case that a given tame graph is not contravening. 

By relying to a greater degree on results that appear in unpublished (but publicly 
available) computer logs, this section is more technical than the others. The purpose of 
the section is to give a sketch of the various ways that the various decomposition stars 
are divided into cases according to a branch and bound strategy. 

The first branching strategy has already been described in Section 24.1. It divides 
the decomposition stars with a given graph into subcases according to the structural 
properties of triangular and quadrilateral standard regions. 

We assume the results from Section 24 that eliminate the most unpleasant types of 
configurations. 


25.1. Review of Internal Structures 


For the past several sections, it has not been necessary to refer to the internal structure 
of the standard clusters. This section is different. To describe the branching operations, 
it is necessary to use details about the structure of standard clusters. 

Recall that a quarter is a set of four vertices with five edges of length at least 2 and 
at most 2f9 and a sixth edge of length at least 2f) and at most 2./2. The long edge of 
the quarter is called its diagonal. A set of quarters with pairwise disjoint interiors has 
been selected. Quarters in this set are said to belong to the Q-system. The Q-system has 
been constructed in such a way that if one quarter along a diagonal lies in the Q-system, 
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then all quarters along that diagonal lie in the Q-system. An anchor is a vertex of the 
packing that has distance at least 2 and at most 2%) from both endpoints of a diagonal. 
Each diagonal has a context (n, k), with n > k, where n is the number of anchors around 
the diagonal and n — k is the number of quarters that have that diagonal as an edge. If 
a diagonal has context (n, k), then k is the number of gaps that occur between anchors; 
that is, spaces that are not filled in by quarters. The context of a quarter is defined to be 
the context of its diagonal. 

Recall that a quarter (or its diagonal) is said to be upright if one endpoint of its 
diagonal is the origin. A quarter is said to be flat if it is not upright and if some vertex of 
the quarter is the origin. 

There is a process of simplification of the decomposition stars and their scoring 
functions that eliminates!*’ many of the contexts (n, k). (The upright quarters are said 
to be erased.) We assume in the following discussion and lemmas that this procedure 
has been carried out. 

An upright diagonal is said to be a loop when there is a reasonable scheme of inserting 
a simplex into each gap so that the diagonal is completely surrounded by quarters and 
the inserted simplices. The simplices that are inserted in the gaps are called anchored 
simplices. They are constructed in such a way that every edge of an anchored simplex 
has length at most 3.2. All simplices in a given loop lie over a single standard region. 
If the gaps cannot be filled with anchored simplices, the upright diagonal is not a loop. 
Details of this construction can be found in Section 11.5. 

In every case the simplices around a given upright diagonal lie in the cone over a 
single standard region. 


Lemma 25.1. Consider an upright diagonal that is a loop. Let R be the standard region 
that contains the upright diagonal and its surrounding simplices. Then the following 
contexts (n, k) are the only ones possible. Moreover, the constants that appear in the 
columns marked o and t are upper and lower bounds respectively for op(D) and Tr(D) 
when R contains one loop of that context. 


Std. region (n, k) oO T 

R quad (4,0) —0.0536 0.1362 

Rpentagon (4, 1) 5 0.27385 
(5,0) —0.157 0.3665 

R hexagon (4, 1) S6 0.41328 


(4,2) —0.1999 0.5309 
(5,1) —0.37595 0.65995 


Rheptagon (4, 1) 87 0.55271 
(4,2) —0.25694 0.67033 
R octagon (4, 1) 5g 0.60722 


(4,2) —0.31398 0.72484 


Proof. This is Lemma 13.5. 


137 Ty detail, we assume that all the contexts that do not carry a penalty have been erased. We leave loops, 
3-crowded, 4-crowded, and 3-unconfined upright diagonals unerased at this point. 
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25.2. 3-Crowded and 4-Crowded Upright Diagonals 


Definition 25.2. Consider an upright diagonal that is not a loop. Let R be the standard 
region that contains the upright diagonal and its surrounding quarters. Then the contexts 
(4, 1) and (5, 1) are the only contexts possible. In the context (4, 1), if there does not 
exist a plane through the upright diagonal such that all three quarters lie in the same half- 
space bounded by the plane, then we say that the context is 3-unconfined. If such a plane 
exists, then we say that the context is 3-crowded. We call the context (5, 1) a 4-crowded 
upright diagonal. Thus, every upright diagonal is exactly one of the following: a loop, 
3-unconfined, 3-crowded, or 4-crowded. A contravening decomposition star contains at 
most one upright diagonal that is 3-crowded or 4-crowded. See Section 11.9 for a proof 
of these facts and for further details. 


Lemma 25.3. Let R be a standard region that contains an upright diagonal that is 
4-crowded. Then 


or(D) <—0.25 and tr(D) > 0.4. 


Let R be a standard region that contains an upright diagonal that is 3-crowded. Then 


or(D) < —0.4339 and tr(D) > 0.5606. 


Proof. See Lemmas 11.11 and 11.18. 


Lemma 25.4. A contravening decomposition star does not contain any upright diag- 
onals that are 3-crowded. 


Proof. (we have an upright diagonal that is 3-crowded, then there is only one excep- 
tional region (0.5606 + ts > (42¢ — 8) pt). We add the inequalities t > 0.5606 and 
o < —0.4339 to the exceptional region. All linear programming bounds drop under 8 pt 
when these changes are made. O 


Upright diagonals that are 4-crowded require more work. We begin with a lemma. 


Lemma 25.5. Let a be the dihedral angle along the large gap along an upright di- 
agonal that is 4-crowded. Let F be the union of the four upright quarters along the 
upright diagonal. Let v, and v2 be the anchors of U(D) lying along the large gap. If 
|v1| + |v2| < 4.6, then a > 1.78 and of(D) < —0.31547. 


Proof. The bound a > 1.78 comes from the inequality archive.'** The upper bound 
on the score is a linear programming calculation involving the inequality a > 1.78 and 
the known inequalities on the score of an upright quarter. 


138 CaLc-161665083. 
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Lemma 25.6. A contravening decomposition star does not contain any upright diag- 
onals that are 4-crowded. 


Proof. Add the inequalities og(D) < —0.25 and tr(D) > 0.4 at the exceptional re- 
gions. An upright diagonal that is 4-crowded does not appear in a pentagon for purely 
geometrical reasons. Run the linear programs for all tame plane graphs with an ex- 
ceptional region that is not a pentagon. If this linear program fails to produce a bound 
of 8pt, we use the lemma to branch into two cases: either y[v;] + y[v2] > 4.6 or 
o[R] < —0.31547. In every case the bound drops below 8 pt. 


25.3. Five Anchors 


Now turn to the decomposition stars with an upright diagonal with five anchors. Five 
quarters around a common upright diagonal in a pentagonal region can certainly occur. 
We claim that any other upright diagonal with five anchors leads to a decomposition star 
that does not contravene. In fact, the only other possible context is (n, k) = (5, 1) (see 
Lemma 25.1). 


Lemma 25.7. Let D be a contravening decomposition star. Then there are no loops 
with context (5, 1) in D. 


Proof. By Lemma 25.1, the standard region R that contains the loop must be a hexagon. 
By the same lemma, we have 


Tr(D) > 0.65995 and opr(D) < —0.37595. 


Add these constraints to the linear program of the tame graphs with a hexagonal face. 
The LP-bound on o (D) with these additional inequalities is less than 8 pt. 


25.4. Penalties 


From now on, we assume that there are no loops with context (5, 1), and no 3-crowded 
or 4-crowded upright diagonals. This leaves various loops and 3-unconfined upright 
diagonals. 

At times, it is necessary to erase certain loops and 3-unconfined upright diagonals. 
There is a penalty for doing so. Let D be a decomposition star with an upright diagonal 
{0, v}. Let D’ be the decomposition star that is identical in all respects, except that v and 
all indices in the decomposition star that point to v (in the sense of Section 6.1) have 
been deleted. Let R be the standard region of D over which v is located, and let R’ be 
the corresponding standard region of D’. We say that the upright diagonal can be erased 
with penalty mp if 


or(D) < op (D') + ZR. 
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Definition 25.8. When we break a single region into smaller regions (by taking the 
part of the region that meets the cone over a quarter, anchored simplex, and so forth) 
the smaller regions will be called subregions. An anchored simplex that overlaps a flat 
quarter is said to mask the flat quarter. (Masked flat quarters are not in the Q-system.) 


Remark 25.9. A function 6 has been defined in Section 11.10. The details of the 
definition of this function are not important here. It is proved there that o is a good upper 
bound on the scoring function on flat quarters no matter what the origin of the flat quarter. 
It gives bounds for flat quarters in the Q-system, masked quarters, isolated quarters, and 
all the other types of flat quarters. The function t on the space of flat quarters is defined 
as 


t(Q) = sol(Q)é pt — c(Q). 
Remark 25.10. At times, we work with various upper bounds to or(D), say, 


or(D) < fr(D). 


When we have a specific upper bound fr(D) in view, then we will also say that the 
upright diagonal can be erased with penalty 7p if 


fr(D) < fr (D) + 7p. 


In more detail, let R = {Rj,..., Rx} be the set of subregions over the anchored simplices 
in a loop. Let fr,(D) be the approximations of the score of each anchored simplex. Let 
Q,,..., Q¢ be the flat quarters masked by the anchored simplices in the loop. Let R’ be 
the subregion of points in the union of R that are not in the cone over any Q;. Then we 
erase with penalty rp if 


Yo f(D) < D> 6 (Qj) = voreo(D) + mr. 
i £ 


If the upright diagonal is not a loop, we include in the set R all regions along the “gaps” 
around the upright diagonal. 


Sections 13.4 and 13.6 makes various estimates of the penalties that are involved 
in erasing various loops and 3-unconfined upright diagonals. Most of the penalties are 
calculated as integer combinations of the constants Ep = 0.01561, €y = 0.003521, and 
0.008. It is proved!*? in Section 11.7 that &p is the penalty for erasing a single upright 
quarter of compression type, and that &y is the penalty for erasing a single upright quarter 
of Voronoi type. 


Lemma 25.11. Let {0, v} be an upright diagonal. 


e If the upright diagonal is 3-unconfined, then the upright diagonal can be erased 
with penalty 0.008. 


139 CaLC-751772680 and CALC-310679005. 
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e If the upright diagonal is 3-unconfined and it masks a flat quarter, then the upright 
diagonal can be erased with penalty 0. 

e If a flat quarter is masked, then its diagonal has length at least 2.6. Also, if the 
diagonal of a masked flat quarter has length at most 2.7, then the height of its 
central vertex is at least 2.2. 


Proof. See Section 11.9. 


25.5. Pent and Hex Branching 


Ifa single linear program does not yield the bound o (D) < 8 pt, then we divide the set of 
decomposition stars with graph G into several subsets, according to the arrangements of 
quarters inside each standard cluster. This section gives a rough classification of possible 
arrangements of quarters in the cone over pentagonal and hexagonal standard regions. 

The possibilities are listed in Figs. 25.1 and 25.2 only up to symmetry by the dihedral 
group action on the polygon. We do not prove the completeness of the list, but its 
completeness can be seen by inspection, in view of the comments that follow here and 
in Section 25.4. Details about the size of the penalties can be found in Section 13.6. 

The conventions for generating the possibilities are different for the pentagons and 
hexagons than for the heptagons and octagons. We describe the pentagons and hexagons 
first. We erase all 3-unconfined upright diagonals. If there is one loop we leave the loop 
in the figure. If there are two loops (so that both necessarily have context (n, k) = (4, 1)), 
we erase one and keep the other. 

The figures are interpreted as follows. An internal vertex in the polygon represents 
an upright diagonal. Edges from that vertex are in 1-1 correspondence with the anchors 
around that upright diagonal. Edges between nonadjacent vertices of the polygon rep- 
resent the diagonals of flat quarters. We draw all edges from an upright diagonal to its 
anchors, and all edges of length [2t9, 2/2] that are not masked by upright quarters. Since 
the only remaining upright quarters belong to loops, the four simplices around a loop 
are anchored simplices and the edge opposite the diagonal has length at most 3.2. 

Various inequalities in the inequality archive have been designed for subregions of 
pentagons. Additional inequalities have been designed for subregions in hexagonal re- 
gions. Thus, we are able to obtain greatly improved linear programming bounds when 


IO 
Ba 


Fig. 25.1. Pentagonal face refinements. 
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fo e 
SARK 


Fig. 25.2. Hexagonal face refinements. The only figures with a penalty are the first two on the top row and 
those on the bottom row. The first two on the top row have penalties 2(0.008) and 0.008. Those on the bottom 
row have penalties 3p, 3p, &p + 2€y, and &p + 2éy. 


we break each pentagonal region into various cases, according to the list of Figs. 25.1 
and 25.2. 


25.6. Hept and Oct Branching 


When the figure is a heptagon or octagon, we proceed differently. We erase all 3- 
unconfined upright diagonals and all loops (either context (n,k) = (4, 1) or (4, 2)) 
and draw only the flat quarters. An undrawn diagonal of the polygon has length at least 
2to. Overall, in these cases much less internal structure is represented. 

In the cases where 3-confined upright diagonals or loops have been erased, a number 
indicating a penalty accompanies the diagram (Figs. 25.3 and 25.4). These penalties are 
derived in Sections 13.6 and 13.4. 

Define values 


Z(3, 1) = 0.00005 and D(3, 1) = 0.06585. 
Here are some special arguments that are used for heptagons and octagons. 
25.6.1. One Flat Quarter. Suppose that the standard region breaks into two subregions: 


the triangular region of a flat quarter Q and one other. Let n = n(R) € {7, 8}. We have 
the inequality 


or(D) < (6(Q) — Z(3, 1)) + 5n + &r + 2Ev. 


322 


Sphere Packings, VI 251 


al 
ane 


Fig. 25.3. Heptagonal face refinements. 


The penalty term > + 2&y comes from a possible anchored simplex masking a flat 
quarter. Let v be the central vertex of the flat quarter Q. Let {v;, v2} be its diagonal. 
Masked flat quarters satisfy restrictive edge constraints. It follows from Section 11.10 
that we have one of the following three possibilities: 


1. ylv] > 2.2. 
2. e[vy, v2] > 2.7. 
3. or(D) < (6(Q) — Z(3, 1)) + Sarr). 


aon 
iPS 
OS 


Fig. 25.4. Octagonal face refinements. 
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25.6.2. Two Flat Quarters. We proceed similarly if the standard region R breaks into 
three subregions: two regions R; and Rp» cut out by flat quarters Q;, Q2 and one other 
region made from what remains. Write G, for G(Q,), and so forth. It follows from 
Section 11.10 that we have one of the following three possibilities: 


1. The height of a central vertex is at least 2.2. 
2. The diagonal of a flat quarter is at least 2.7. 
3. 


or(D) < (6, — ZG, 1)) +2 — ZG, 1)) + sna, 
tr(D) > (t) — DB, D)) + ( — DB, )D) + tie. 


With heptagons, it is helpful on occasion to use an upper bound on the penalty of 
3&p = 0.04683. This bound holds if neither flat quarter is masked by a loop. For this, it 
suffices to show that the first two of the given three cases do not hold. 

If there is a loop of context (n, k) = (4, 2), we have the upper bounds of Lemma 25.1. 
If, on the other hand, there is no loop of context (n, k) = (4, 2), then we have the upper 
bound 


or(D) < (6(Q1) — ZG, 1)) + (G(Q2) — ZG, 1)) + Sacry + 2Er + 2Ev), 


where n(R) € {7, 8}. 


25.7. Branching on Upright Diagonals 


We divide the upright simplices into two domains depending on the height of the upright 
diagonal, using |v| = 2.696 as the dividing point. We break the upright diagonals (of 
unerased quarters in the Q-system) into cases: 


1. The upright diagonal has height at most 2.696. 

2. The upright diagonal {0, v} has height at least 2.696, and some anchor w along 
the flat quarter satisfies |w| > 2.45 or |v — w| > 2.45. (There is a separate case 
here for each anchor w.) 

3. The upright diagonal {0, v} has height at least 2.696, and every anchor w along 
the flat quarter satisfies |w| < 2.45 and |v — w| < 2.45. 


Many inequalities have been specially designed to hold on these smaller domains. They 
are included into the linear programming problems as appropriate. 

When all the upright quarters can be erased, then the case for upright quarters follows 
from some other case without the upright quarters. An upright quarter can be erased in 
the following situations. If the upright quarter Q has compression type (in the sense of 
Definition 7.8) and the diagonal has height at least 2.696, then!*° 


o(Q) < s-vorp(Q). 


140 CaLc-214637273. 
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(If there are masked flat quarters, they become scored by G.) If an upright quarter has 
Voronoi type and the anchors w satisfy |w| < 2.45 and |v — w| < 2.45, then the quarter 
can be erased!4! 


o(Q) < s-voro(Q). 


In general, we only have the weaker inequality! 


o(Q) < s-vorp(Q) + 0.003521. 


In a pentagon or hexagon, consider an upright diagonal with three upright quarters, 
that is, context (n, k) = (4, 1). If the upright diagonal has height at most 2.696, and if an 
upright quarter shares both faces along the upright diagonal with other upright quarters, 
then we may assume that the upright quarter has compression type. For otherwise, there 
is a face of circumradius at least /2, and hence two upright quarters of Voronoi type. 
The inequality 


octavor < octavorp —0.008, (25.1) 


if y; € [2to, 2.696], and 126 > ./2, shows that the upright quarters can be erased without 
penalty because 


&p — 0.008 — 0.008 < 0. 


If erased, the case is treated as part of a different case. 

This allows the inequalities'*? to be used that relate specifically to upright quarters of 
compression type. Furthermore, it can often be concluded that all three upright quarters 
have compression type. For this, we use various inequalities in the archive which can 
often be used to show that if the anchored simplex has a face of circumradius at least 
./2, then the linear programming bound on a (D) is less than 8 pt. 


25.8. Branching on Flat Quarters 
We make a few general remarks about flat quarters. 


Remark 25.12. Information about the internal structure of an exceptional face gives 
improvements to the constants 1.4 pt and 1.5 pt of Property 4 in the definition of admissi- 
ble weight assignments. (The bounds remain fixed at 1.4 ptand 1.5 pt, but these arguments 
allow us to specify more precisely which simplices contribute to these bounds.) These 
constants contribute to the bound on t(D) through the admissible weight assignment. 
Assume that at the vertex v there are four quasi-regular tetrahedra and an exceptional 
face, and that the exceptional face has a flat quarter with central vertex v. The calcu- 
lations of Section 22.3 show that the union F of the four quasi-regular tetrahedra and 
exceptional region give tr(D) > 1.5 pt. If there is no flat quarter with central vertex v, 
then the union F of four quasi-regular tetrahedra along {0, v} give tr(D) > 1.5 pt. We 
can make similar improvements when tri(v) = 3. 


141 CaLc-378432183. 
142 Catc-310679005. 
143 See, for example, CALC-867513567-*. 
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Remark 25.13. There are a few other interval-based inequalities that are used in par- 
ticular cases. The inequalities yj < 2.2, ya < 2.7, 034, Nase < /2 imply that the flat 
quarter has compression type (see Section 7.1). The circumradius is not a linear pro- 
gramming variable, so its upper bound must be deduced from edge-length information. 


If all three corners of a flat quarter have height at most 2.14, and if the diagonal has 
length less than 2.77, then the circumradius of the face containing the origin and diagonal 
is at most 7(2.14, 2.14, 2.77) < /2. This allows us to branch combine into three cases. 


Lemma 25.14. Let Q be a flat quarter whose corners v; have height at most 2.14 and 
whose diagonal is at most 2.77. Then one of the following is true: 


1. o(Q) =T(Q). 
2. The diagonal has length < 2.7, n(ya, ys, Yo) = afd. and 0 (Q) < s-voro(Q). 
3. The diagonal has length > 2.7 and o(Q) < s-voro(Q). 


Proof. Case 1 holds when Q is a quarter of compression type in the Q-system. If 
Q is in the Q-system but is not of compression type, then n(y4, ys, 6) > V2 and 
o(Q) < s-voro(Q). If Q is not in the Q-system, then s-vorg(Q) is an upper bound 
Lemma 11.26. If Q is not in the Q-system, then its diagonal has length at least 2.7, or the 
central vertex has height at most 2.2 (see Lemma 25.11.) In this case we use the upper 
bound s-voro(Q). O 


Various inequalities in the archive have been designed specifically for each of these 
three cases. Thus, whenever the hypotheses of the lemma are met, we are able to improve 
on the linear programming bounds by breaking into these three cases. 


25.9. Branching on Simplices that Are Not Quarters 


Lemma 25.15. Suppose that a triangular subregion comes from a simplex S with one 
vertex at the origin and three other vertices of height at most 2t9. Suppose that the edge 
lengths of the fourth, fifth, and sixth edges satisfy ys, yo € [2to,2V2], ya € [2, 2to]. 
Suppose that min(ys, yo) < 2.77. Then one of the following is true: 


1. The edges have lengths ys, yo € [2to, 2.77], Nase = af. and o (S$) < s-voro(S). 
2. 5, v6 € [2fo, 2.77], and a(S) < s-vor(S) (the analytic Voronoi function). 
3. An edge (say ye) has length yg > 2.77 and a(S) < s-voro(S). 


Proof. fwe ignore the statements about o,, then the conditions in the lemma concerning 
edge length are exhaustive. The bounds on o in each case are given by Section 9.6. O 


There are linear programming inequalities that are tailored to each case. 
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25.10. Branching on Quadrilateral Subregions 


One of the inequalities holds for a quadrilateral subregion, if certain conditions are 
satisfied. One of the conditions is y4 € [2,/2, 3.0], where y4 1s a diagonal of the subregion. 
Since this diagonal is not one of the linear programming variables, these bounds cannot 
be verified directly from the linear program. Instead we use an inequality which relates 
the desired bound y4 < 3 to the linear programming variables a[v, F'], y2, y3, ys, and ye. 


25.11. Implementation Details for Branching 


We now make a detailed examination of the internal structure of exceptional regions. 
A refinement F of a face F of a plane graph G is a set F of faces such that: 


1. The intersection of the vertex set of G with that of F is the set F. 
2. F U{F°?} is a plane graph. 


We use refinements of faces to describe the internal structure of faces. 
We introduce indexing sets FACE-F, VERTEX-F, ANGLE-F’,, EDGE- F, the sets of 
faces, vertices, angles, and edges in F,, respectively, analogous to those introduced for G. 


We create variables 2[F'], and indexed variables 


sol{FACE-F}, sc{FACE-F} tsc{FACE-F}, 


a{ANGLE-F}, y{VERTEX-F}, ef 


GI 


DGE-F}. 


“cc 


(Variables with names “y[v]” and “e[v, w]” were already created for some v,w € 
VERTEX-F M VERTEX. In these cases we use the variables already created.) 

Each vertex v in the refinement will be interpreted either as a vertex v’ € U(D), oras 
the endpoint of an upright diagonal lying over the standard region F’. We interpret the 
faces of the refinement in terms of the geometry of the decomposition star D variously as 
flat quarters, upright quarters, anchored simplices, and the other constructs of the fourth 
paper. This interpretation depends on the context, and is described in greater detail below. 

Once the interpretation of faces is fixed, the interpretations are as before for the 
variable names introduced already: y, e, a, sol. The lower and upper bounds for w and 
sol are as before. The lower and upper bounds for y[v] are 2 and 2% if v’ € U(D), but 
if (0, v’) is an upright diagonal, then the bounds are [2t, 2,/2]. The lower and upper 
bounds for e will depend on the context. 


25.12. Variables Related to Score 


The variables sc are a stand-in for the score o on a face. We do not call them o because 
the sum of these variables will not in general equal the variable o[F'], when F is a 
refinement of F: 


b sc[ F’] 4 “in| ; 


FleF 
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We use a weaker relation: 


o[F] < » sc[F’] + 2[F]. 


F'eF 


The variable z [F ] is called the penalty associated with the refinement F. (Penalties are 
discussed at length in Sections 13.6 and 13.4.) The interpretations of sc and a(F ] are 
rather involved, and are discussed on a case-by-case basis below. The interpretation of 
tsc follows from the identity 


tsc[ F’] = sol[ F’]¢ pt — sc[ F’], VF € F. 


The interpretation of variables that follows might appear to be hodge-podge at first. 
However, they are obtained in a systematic way. We analyze the proofs and approxima- 
tions in Paper IV, and define sc[F]/ as the best penalty-free scoring approximation that 
is consistent with the given face refinement. Here are the details. 

If the subregion is a flat quarter, the interpretation of sc[ F'] is the function 6, defined 
in Section 11.10. If the subregion is an upright quarter Q, the interpretation of sc[ F'] 
is the function o(Q) from Section 7. If the subregion is an anchored simplex that is 
not an upright quarter, sc[ F’] is interpreted as the analytic Voronoi function vor if the 
simplex has type C or C’, and as vor otherwise. (The types A, B, C and C’ are defined in 
Section 9.4.) Whether or not the simplex has type C, the inequality sc[ F] < 0 is satisfied. 
In fact, if vorp scoring is used, we note that there are no quoins, and g(1, fo) < 0. 

If the subregion is triangular, if no vertex represents an upright diagonal, and if the 
subregion is not a quarter, then sc[F'] is interpreted as vor or vorg depending on whether 
the simplex has type A. In either case, the inequality sc[ F’'] < voro is satisfied. 

In most other cases the interpretation of sc[ F'] is vorg. However, if R is a heptagon 
or octagon, and F has four or more sides, then sc[ F’] is interpreted as vorg except on 
simplices of type A, where it becomes the analytic Voronoi function. 

If R is a pentagon or hexagon, and F is a quadrilateral that is not adjacent to a flat 
quarter, and if there are no penalties in the region, then the interpretation of sc[ F’] is the 
actual score of the subregion over the subregion. In this case the score og has a well- 
defined meaning for the quadrilateral, because it is not possible for an upright quarter in 
the Q-system to straddle the quadrilateral region and an adjacent region. Consequently, 
any erasing that is done can be associated with the subregion without ambiguity. By the 
results of Sections 8.4 and 8.5, we have sc[F'] < 0. We also have sc[F'] < voro. 

One other bound that we have not explicitly mentioned is the bound or(D) < Sy. 
For heptagons and octagons that are not aggregates, this is a better bound than the one 
used in the definition of tameness (Property 6). In heptagons and octagons that are not 
aggregates, if we have a subregion with four or more sides, then sc[F’'] < Z(n, k) and 
tsc[F] > D(n, k). (See (13.1) and (13.2) in Section 13.5.) 

The variables are subject to a number of compatibility relations that are evident from 
the underlying definitions and geometry. 


sol[F’] = >». a[v, F’] — (len[F’] — 2)x, VF’, 


veF’ 


> alu, F'] =al[v, F], Vv. 


F': véF', F'€ FACE—F 
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Assume that a face F, € F has been interpreted as a subregion R = F/ of a standard 
region. Assume that each vertex of F is interpreted as a vertex in U (D) or as the endpoint 
of an upright diagonal over F’. One common interpretation of sc is voro,-(U(D)), 
the truncated Voronoi function. When this is the interpretation, we introduce further 
variables: 


quolv, s(v, Fi)], Vu € Fi, 
quo[s(v, F)), v], Vue Fi, 
Adih[v, Fi], Vu € Fi. 


We interpret the variables as follows. If w = s(v), and the triangle (0, v’, w’) has 
circumradius 77 at most fo, then 


quo[v, w]’ = quo(R(|v’|/2, n, to), 


quo[w, v]’ = quo(R(|w'|/2, 7, to)). 
If the circumradius is greater than fo, we take 
quo[v, wl] = quo[w, vi =0. 
The variable Adih has the following interpretation: 


A(|v"|/2)a(v', Ff), lv!| < 2h, 


Adih[v, F\]/ = 
0, otherwise. 


Under these interpretations, the following identity is satisfied: 


sc[ F;] = sol F;]v + > Adih[v, Fi] — 48cct > quo[v, w]. 


veF, 


The final sum runs over all pairs (v, w), where v = s(w, F\) or w = s(v, F)). 

For this to be useful, we need good inequalities governing the individual variables. 
Such inequalities for Adih[v, F] and quo[v,w] are found in Calculations 
CALC-815275408 and CALC-349475742. To make of use these inequalities, it is nec- 
essary to have lower and upper bounds on a[v, F'] and y[v]. We obtain such bounds as 
LP-derived inequalities in the sense of Remark 23.4. 


25.13. Appendix: Hexagonal Inequalities 


There are a number of inequalities that have been particularly designed for standard 
regions that are hexagons. This appendix describes those inequalities. They are generally 
inequalities involving more than six variables, and because of current technological 
limitations on interval arithmetic, we were not able to prove these inequalities directly 
with interval arithmetic. 

Instead we give various lemmas that deduce the inequalities from inequalities in a 
smaller number of variables (small enough to prove by interval arithmetic). 
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25.13.1. Statement of Results. There are a number of inequalities that hold in spe- 
cial situations when there is a hexagonal region. Although these inequalities do not 
appear in the main text of the proof of the Kepler conjecture, they are used in the linear 
programs. 

After stating all of them, we turn to the proofs. 


1. If there are no flat quarters and no upright quarters (so that there is a single 
subregion F’), then 


vorg < —0.212, (25.2) 
T > 0.54525. (25.3) 


2. If there is one flat quarter and no upright quarters, there is a pentagonal subregion 
F. It satisfies 


vorg < —0.221, 
T > 0.486. 


3. If there are two flat quarters and no upright quarters, there is a quadrilateral sub- 
region F’, It satisfies 


vorg < —0.168, 
tT > 0.352. 


These are twice the constants appearing in Inequality 11. 

4. If there is an edge of length between 2f and 2/2 running between two opposite 
corners of the hexagonal cluster, and if there are no flat or upright quarters on one 
side, leaving a quadrilateral region F’, then F satisfies 


vorg < —0.075, 
tT > 0.176. 


5. If the hexagonal cluster has an upright diagonal with context (4, 2), and if there 
are no flat quarters (Fig. 25.5), then the hexagonal cluster R satisfies 


Or < —0.297, 
Tr > 0.504. 


Fig. 25.5. A hexagonal cluster with context (4, 2). 
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Fig. 25.6. A hexagonal cluster with context (4, 2). 


6. If the hexagonal cluster has an upright diagonal with context (4, 2), and if there is 
one unmasked flat quarter (Fig. 25.6), let {F'} be the set of four subregions around 
the upright diagonal. (That is, take all subregions except for the flat quarter.) In 
the following inequality and Inequality 7, let a, be defined as or on quarters, and 
vor, on other anchored simplices. tp is the adapted squander function: 


of < —0.253, 


tt > 0.4686. 


7. If the hexagonal cluster has an upright diagonal with context (4, 2), and if there are 
two unmasked flat quarters (Fig. 25.7), let {F'} be the set of four subregions around 
the upright diagonal. (That is, take all subregions except for the flat quarters.) 


Yoox < 0.2, 
(4) 


yom > 0.3992. 
(4) 


8. If the hexagonal cluster has an upright diagonal in context (4, 1), and if there are 
no flat quarters, let { F'} be the set of four subregions around the upright diagonal. 
Assume that the edge opposite the upright diagonal on the anchored simplex has 
length at least 2\/2. (See Fig. 25.8.) 


voro,r(D) + }\o(Q) < —0.2187, 
(3) 


Fig. 25.7. A hexagonal cluster with context (4, 2). 
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Fig. 25.8. A hexagonal cluster with context (4, 1). 


t,n(D) +} 1(Q) > 0.518. 
(3) 


9. In this same context, let F be the pentagonal subregion along the upright diagonal. 
It satisfies 


vorg < —0.137, (25.4) 
T > 0.31. (25.5) 


10. If the hexagonal cluster has an upright diagonal in context (4, 1), and if there is 
one unmasked flat quarter, let {F'} be the set of four subregions around the upright 
diagonal. Assume that the edge opposite the upright diagonal on the anchored 
simplex has length at least 2\/2. (There are five subregions, shown in Fig. 25.9.) 


voro,2(D) + }\o(Q) < —0.1657, 
(3) 


T,2(D) + }*t(Q) > 0.384. 
(3) 


11. In this same context, let F’ be the quadrilateral subregion in Fig. 25.9. It satisfies 


vorg < —0.084, 
T > 0.176. 


25.13.2. Proof of the Inequalities 


Proposition 25.16. Inequalities 1-11 are valid. 


Fig. 25.9. A hexagonal cluster with context (4, 1). 
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We prove the inequalities in reverse order, 11-1. The bounds!* vorg < 0.009 and 
t > 0.05925 for flat quarters with diagonal \/8 are used repeatedly. Some of the proofs 
make use of tec-bounds, which are described in Section 12.9. 


Proof of Inequalities 10 and 11. Break the quadrilateral cluster into two simplices S 
and S" along the long edge of the anchored simplex S. The anchored simplex S satisfies 
t(S) => 0, o(S) < 0. The other simplex satisfies t9(S’) > 0.176 and vorg(S’) < 
—0.084 by an interval calculation.'*> This gives Inequality 11. For Inequality 10, we 
combine these bounds with the linear programming bound on the four anchored simplices 
around the upright diagonal. From a series of inequalities!*° we find that they score 
< —0.0817 and squander > 0.208. Adding these to the bounds from Inequality 11, we 
obtain Inequality 10. O 


Proof of Inequalities 8 and9. The pentagon is a union of an anchored simplex and a 
quadrilateral region. LP-bounds similar to those in the previous paragraph and based 
on the inequalities of Section 13.12 show that the loop scores at most —0.0817 and 
squanders at least 0.208. If we show that the quadrilateral satisfies 


vorg < —0.137, (25.6) 
tT > 0.31, (25.7) 


then Inequalities 8 and 9 follow. If by deformations a diagonal of the quadrilateral drops 
to 2/2, then the result follows interval calculations.'*7 By this we may now assume that 
the quadrilateral has the form 


(a1, 2, dz, 2, a3, 2, a4, bg), a2, a3 € {2, 2to}. 


If the diagonals drop under 3.2 and max(a2, a3) = 29, again the result follows from 
interval calculations.'** If the diagonals drop under 3.2 and ay = a3 = 2, then the 
result follows from further interval calculations.'° So finally we attain by deformations 
by = 2,/2 with both diagonals greater than 3.2. However, this does not exist, because 


AG, 4, 4, 3.27, 4,8, 3.2") < 0. q 


Proof of Inequalities 5—7. Inequality 7 is derived in Section 13.12. Inequalities 5 and 
6 are LP-bounds based on interval calculations. !*° 


Proof of Inequality 4. Deform as in Section 12. If at any point a diagonal of the 
quadrilateral drops to 2./2, then the result follows from interval calculations!>! and 


44 CALC-148776243. 

45 CALC-938091791. 

46 CALC-815492935, CALC-187932932, CALC-485049042, CALC-835344007. 
47 CALC-148776243, CALC-468742136. 

48 calc148776243, CALC-468742136. 

49 CALC-128523606. 

50 CALC-815492935, CALC-187932932, CALC-485049042. 

51 CALC-148776243. 
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Inequality 11: 
vorg < 0.009 — 0.084 = —0.075, 
tT > 0+0.176 = 0.176. 
Continue deformations until the quadrilateral has the form 
(a1, 2, dz, 2, a3, 2, da, ba), az, a3 € {2, 210}. 
There is necessarily a diagonal of length < 3.2, because 
A(4, 4, 3.27, 8,4, 3.27) <0. 


Suppose the diagonal between vertices v2 and v4 has length at most 3.2. If az = 2f9 
or a3 = 2fo, the result follows from interval calculations!>? and Inequality 11. Take 
dz = a3 = 2. Inequality 4 now follows from interval calculations. !>? 


Proof of Inequality 3.__ We prove that the quadrilateral satisfies 
vorp < —0.168, 
tT > 0.352. 


There are two types of quadrilaterals: (a) two flat quarters whose central vertices are 
opposite corners of the hexagon, and (b) flat quarters who share a vertex. We consider 
case (a) first. 

Case (a). We deform the quadrilateral as in Section 12.If at any point there is a 
diagonal of length at most 3.2, the result follows from Inequalities 10 and 11. Otherwise, 
the deformations give us a quadrilateral: 


(a1, 2, a2, 2to, a3, 2, a4, 2), aj € {2, 2to}. 


The tcc approximation now gives the result (see Section 12.10). 

Case (b). Label the vertices of the quadrilateral v1, ..., va, where (v1, v2) and (v1, v4) 
are the diagonals of the flat quarter. Again, we deform the quadrilateral. If at any point 
of the deformation, we find that |v; — v3| < 3.2, the result follows from Inequalities 
10 and 11. If during the deformation |v. — v4| < 2,/2, the result follows from interval 
calculations.!™ If the diagonal (v2, v4) has length at least 3.2 throughout the deformation, 
we eventually obtain a quadrilateral of the form 


(a, 2to, a, 2. a, 2, a4, 2to), ai € {2, 2to}. 
However, this does not exist: 
A(4, 4, 3.27, (2t9)”, (2t9)*, 3.27) < 0. 


We may assume that |v2 — v4] € (2/2, 3.2]. The result now follows from interval 
calculations.!>> 


152 CaLc-148776243. 
153 CaLC-128523606. 
154 CaLC-148776243, CALC-3 15678695. 
155 CALC-315678695. 
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Proof of Inequality 2. This case requires more effort. We show that 


vorp < —0.221, 
tT > 0.486. 


Label the corners (v1, ..., vs) cyclically with (v,, vs) the diagonal of the flat quarter in 
the hexagonal cluster. We use the deformation theory of Section 12. The proof appears 
in steps (1)-(6). 

(1) If during the deformations, |v; — v4| < 3.2 or |v2 — us| < 3.2, the result follows 
from the Inequalities in Section 25.13.2 and 11. We may assume this does not occur. 

(2) If an edge (v1, v3), (v2, v4), Or (V3, Us) drops to 2./2, continue with deformations 
that do not further decrease this diagonal. If |v; — v3| = |v3 — vs| = Qn/ 2, then the result 
follows from interval calculations.!*° 

If we have |v; — v3| = 2/2, deform the figure to the form 


(a1, 2, a2, 2, a3, 2, a4, 2, as, 210), a2, a4, a5 € {2, 2to}. 


Once it is in this form, break the flat quarter (0, v;, v2, v3) from the cluster and deform 
v3 until a3 € {2, 2p}. The result follows from an interval calculation. !>7 

We handle a boundary case of the preceding calculation separately. After breaking 
the flat quarter off, we have the cluster 


(a1, 2V2, a3, 2, a4, 2, as, 2t0), a3, a4, As € {2, 2to}. 


If |v; — v4| = 3.2, we break the quadrilateral cluster into two pieces along this diagonal 
and use interval calculations'** to conclude the result. This completes the analysis of the 
case |v) — v3| = 2/2. 

(3) If |v2 — v4| < 3.2, then deform until the cluster has the form 


(a1, 2, a2, 2, a3, 2, a4, 2, a5, 2t0), a1, 43, a5 € {2, 2to}. 


Then cut along the special simplex to produce a quadrilateral. Disregarding cases already 
treated by the interval calculations,!°? we can deform it to 


(a1, 2, a2, 202, a4, 2,45, 2t), ai € {2, 2to}, 


with diagonals at least 3.2. The result now follows from interval calculations.!© 


In summary of (1)-(3), we find that by disregarding cases already considered, we 
may deform the cluster into the form 


(a1, 2, dy, 2, a3, 2, a4, 2, ds, 29), aj € {2, 2to}, 


Vy — v3| > 2/2, |v3 — Us| > 2/2, |vz — v4| > 3.2. 


156 CALC-148776243, CALC-673399623. 
1ST CALC-297256991. 
158 CaLc-861511432. 
159 CaLc-861511432. 
160 CAaLC-746445726. 
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(4) Assume |v; — v3], [v3 — Us| < 3.2. If max(a,, a3, a5) = 2f9, we invoke interval 
calculations!*! to prove the inequalities. So we may assume a, = a3 = as = 2. The result 
now follows from interval calculations.!®* This completes the case |v; — v3], |v3 — us| < 
3.2. 

(5) Assume |v; — v3], |v3 — us| > 3.2. We deform to 


(a1, 2, a2, 2, a3, 2, a4, 2, a5, 20), aj € {2, 210}. 


If az = 2%) and a; = a3 = 2, then the simplex does not exist by Section 13.7. Similarly, 
a4 = 20, ds = a3 = 2 does not exist. The tcc bound gives the result except when 
ay = a4 = 2. The condition |vz — v4| > 3.2 forces az = 2. These remaining cases are 
treated with interval calculations. !® 

(6) Assume |v, — v3| < 3.2 and |v3 — vs| > 3.2. This case follows from deformations 
and interval calculations.'® This completes the proof of Inequality 2. O 


Proof of Inequality 1. Label the corners of the hexagon v,,..., v6. The proof to this 
inequality is similar to the other cases. We deform the cluster by the method of Section 12 
until it breaks into pieces that are small enough to be estimated by interval calculations. 
If a diagonal between opposite corners has length at most 3.2, then the hexagon breaks 
into two quadrilaterals and the result follows from Inequality 25.13.2. 

If a flat quarter is formed during the course of deformation, then the result follows 
from Inequality 2 and interval calculations.'®° Deform until the hexagon has the form 


(a1, 2, do, 2,..., dg, 2), a; € {2, 2to}. 


We may also assume that the hexagon is convex (see Section 12.12). 

If there are no special simplices, we consider the tcc-bound. The tcc-bound implies 
Inequality 1, except when a; = 2, for all i. However, if this occurs, the perimeter of the 
convex spherical polygon is 6 arc(2, 2, 2) = 27. Thus, there is a pair of antipodal points 
on the hexagon. The hexagon degenerates to a lune with vertices at the antipodal points. 
This means that some of the angles of the hexagon are zr. One of the tcc’s has the form 
C(2, 1.6, 7), in the notation of Section 12.10. With this extra bit of information, the tcc 
bound implies Inequality 1. 

If there is one special simplex, say |v; — v;| € [2./2, 3.2], we remove it. The score 


of the special simplex is! 

vorg < 0, T > 0.05925, if max(|v1], |vs|) = 2¢0, 

vorg < 0.0461, 1} > 0, if |uy| = |vs| = 2, 
The resulting pentagon can be deformed. If by deformations, we obtain |v2 — us| = 
3.2 or |v; — v4| = 3.2, the result follows from Inequalities 25.13.2 and two interval 
calculations.!° 


161 CaLC-148776243, CALC-297256991. 
162 CaLc-897046482. 
163 CALC-928952883. 
164 CaLC-297256991, CALC-673800906. 
165 CaLc-148776243. 
166 CaLc-148776243. 
167 CaLC-725257062, CALC-977272202. 
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If |v; — vj| = 2V2, we use Inequality 2 and interval calculations'® unless |v;| = 
|vs| = 2. If |u| = |vs| = 2, we use interval calculations.!® If a second special simplex 
forms during the deformations, the result follows from interval calculations.!” 

The final case of Inequality | to consider is that of two special simplices. We divide 
this into two cases: (a) The central vertices of the specials are vz and v6. (b) The central 
vertices are opposite v; and v4. In case (a) the result follows by deformations and interval 
calculations.'7! In case (b) the result follows by deformations and interval calculations.!7” 
This completes the proof of Inequality 1 and the proof of the proposition. 


25.14. Conclusion 


By combinations of branching along the lines set forth in the preceding sections, a 
sequence of linear programs is obtained that establishes that o (D) is less than 8 pt. For 
details of particular cases, the interested reader can consult the log files in [Hal6], which 
record which branches are followed for any given tame graph. (For most tame graphs, a 
single linear program suffices.) 

This completes the proof of the Kepler conjecture. 
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A Revision to the Proof of the Kepler Conjecture 
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Introduction to Part III 


Part II presents a 2010 follow-up paper of T. C. Hales, J. Harrison, S$. McLaugh- 
lin, T. Nipkow, S. Obua, and R. Zumkeller, “A Revision of the Proof of the Kepler 
Conjecture” (Discrete Comput. Geom., 44 (2010), 1-34). 


This paper has three objects. First, it describes an ongoing project to give a formal 
proof of the Kepler Conjecture. A formal proof of the Kepler Conjecture will provide 
a definitive version of the proof, checkable entirely by computer within a formal 
language. Second, the initial process of formalization of the original proof uncovered 
a gap in its detailed logic, and this paper supplies details to fill this gap. Third, 
this paper lists and corrects errata in the original Hales-Ferguson proof in Discrete 
Comput. Geom. 
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Introduction 


In 2006, Discrete and Computational Geometry devoted an issue to the proof of the 
Kepler conjecture on sphere packings, which asserts that no packing of congruent 
balls in three-dimensional Euclidean space can have density greater than that of the 
face-centered cubic packing [6, 14-16, 23]. 

The proof is long and complex. The editors’ forward to that issue remarks that 
“the reviewing of these papers was a particularly enormous and daunting task.” “The 
main portion of the reviewing took place in a seminar run at Eétvos University over a 
3 year period. Some computer experiments were done in a detailed check. The nature 
of this proof, consisting in part of a large number of inequalities having little internal 
structure, and a complicated proof tree, makes it hard for humans to check every step 
reliably. Detailed checking of specific assertions found them to be essentially correct 
in every case tested. The reviewing process produced in the reviewers a strong degree 
of conviction of the essential correctness of the proof approach, and that the reduction 
method led to nonlinear programming problems of tractable size.” 

The process of review and revision did not end when the proof was published. This 
article summarizes the current status of a long-term initiative to reorganize the origi- 
nal proof into a more transparent form and to provide a greater level of certification 
of the correctness of the computer code and other details of the proof. 

The article contains two parts. The first part describes an initiative to give a formal 
proof of the Kepler conjecture. The second part gives errata in the original proof of 
the Kepler conjecture. Most of these errata are minor. However, a significant new 
argument appears in a separate section (Sect. 8). It finishes an incomplete argument 
in the original proof asserting that there is no loss in generality in assuming (for 
purposes of the main estimate) that subregions are simple polygons. The incomplete 
argument was detected during the preparation of the blueprint edition of the proof, 
which is described in Sect. 2. 

In this article, the original proof refers to the proof published in [23]. S. Ferguson 
and T. Hales take full responsibility for every possible error in the original proof of 
the Kepler conjecture. Over the past decade, many have contributed significantly to 
making that proof more reliable. 


Part 1. Formal Proof Initiative 

1 The Flyspeck Project 

The purpose of a long-term project, called the Flyspeck project, is to give a formal 
proof of the Kepler conjecture. This section makes some preliminary remarks about 
formal proofs and gives a general overview of the current status of this project. 


1.1 Formal Proof 


A formal proof is a proof in which every logical inference has been checked, all the 
way back to the foundational axioms of mathematics. No step is skipped no matter 
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how obvious it may be to a mathematician. A formal proof may be less intuitive, and 
yet is less susceptible to logical errors. Because of the large number of inferences 
involved, a computer is used to check the steps of a formal proof. 

It is a large labor-intensive undertaking to transform a traditional proof into a for- 
mal proof. The first stage is to expand the traditional proof in greater detail. This stage 
fills in steps that a mathematician would regard as obvious, works out arguments that 
the original proof leaves to the reader, and supplies the assumed background know]- 
edge. In a final stage, the detailed text is transcribed into a computer-readable format 
inside a computer proof assistant. The proof assistant contains mathematical axioms, 
logical rules of inference, and a collection of previously proved theorems. It validates 
each new lemma by stepping through each inference. No other currently available 
technology is able to provide levels of certification of a complex mathematical proof 
that is remotely comparable to that available by formal computer verification. A gen- 
eral overview of formal proofs can be found at [8, 22, 28]. 

Proof assistants differ in detail in the way they treat the formalization of a theorem 
that is itself a computer verification (such as the proof of the four-color theorem or 
the proof of the Kepler conjecture). In general, a formal proof of a computer ver- 
ification can be viewed as a formal proof of the correctness of the computer code 
used in the verification. That is, the formal proof certifies that the code is a bug-free 
implementation of its specification. 


1.2 Formal Proof of the Kepler Conjecture 


As mentioned above, the purpose of the Flyspeck project is to give a formal proof of 
the Kepler conjecture. (The project name Flyspeck comes from the acronym FPK, for 
the Formal Proof of the Kepler conjecture.) This is the most complex formal proof 
ever undertaken. We estimate that it may take about twenty work-years to complete 
this formalization project. 

The Flyspeck project is introduced in the article [13]. The project page gives the 
latest developments [18]. The project is now at an advanced stage; in fact, we estimate 
that the project is now about half-way complete. One of the main purposes of this 
article is to present a summary of the current status of this project. 

In the original proof of the Kepler conjecture, there was a long mathematical text 
and three major pieces of computer code. The written part of the proof has been sub- 
stantially revised with aims of the Flyspeck project in mind. Section 2 compares this 
revised text with the original. There is now a good match between the mathemati- 
cal background assumed in the text and the mathematical material that is available 
in the proof assistant HOL Light. Section 3 describes the current level of support 
in HOL Light for the formalization of Euclidean space and measure theory. In the 
years following the publication of the original proof, S. McLaughlin has reworked 
and largely rewritten the entire body of code in a form that is more transparent and 
more amenable to formalization. Section 4 points out some difficulties in verifying 
the computer code in its original form and documents the reimplementation. 

There have been three Ph.D. theses written on the Flyspeck project, one devoted 
to each of the three major pieces of computer code. The first piece of computer code 
uses interval arithmetic to verify nonlinear inequalities. R. Zumkeller’s thesis [49] 
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develops nonlinear inequality proving inside the proof assistant Coq. Section 5 gives 
an example of this work. The second piece of computer code enumerates all tame 
graphs. (The definition of tameness is rather intricate; its key property is that the set 
of tame graphs includes all graphs that give a potential counterexample to the conjec- 
ture.) G. Bauer’s thesis, together with subsequent work with T. Nipkow, completes 
the formal proof of the enumeration of tame graphs [38]. Section 6 gives a sum- 
mary of this formalization project. The third piece of computer code generates and 
runs some 10° linear programs. These linear programs show that none of the poten- 
tial counterexamples to the Kepler conjecture are actual counterexamples. S. Obua’s 
thesis develops the technology to generate and verify the linear programs inside the 
proof assistant Isabelle [39]. Section 7 describes this research. 

The ultimate aim is to develop a complete formal proof of the Kepler conjecture 
within a single proof assistant. Because of the scope of the problem and the number 
of researchers involved, different proof assistants have been used for different parts 
of the proof: HOL Light for background in Euclidean geometry and the text, Coq for 
nonlinear inequality verification, and Isabelle/HOL for graph enumeration and linear 
programming. This raises the issue of how to translate a formal proof automatically 
from one proof assistant to another. Implementations of automated translation among 
the proof assistants HOL, Isabelle, and Coq can be found at [4, 33, 40, 47]. 


2 Blueprint Edition of the Kepler Conjecture 


The blueprint edition of the proof of the Kepler conjecture is a second-generation 
proof that contains far more explicit detail than the original proof. The blueprint 
edition is available at [20, 21]. Many proofs have been significantly simplified and 
systematized. It has been written in a manner to permit easy formalization. As its 
name might suggest, this version is intended as a blueprint for the construction of a 
formal proof. This section compares the blueprint edition with the original. 


2.1 Lemmas in Elementary Geometry 


A collection of about 200 lemmas that can be expressed in elementary terms has been 
extracted from the original proof and placed in a separate collection [21]. This has 
several advantages. First of all, these lemmas, although elementary, are precisely the 
parts of the original proof that put the greatest burden on the reader’s geometrical 
intuition. (Many of these lemmas deal with the existence or nonexistence of con- 
figurations of several points in R? subject to various metric constraints.) Also, the 
lemmas can be stated without reference to the Kepler conjecture and all of the ma- 
chinery that has been introduced to give a proof. Finally, the proofs of these lemmas 
rely on similar methods and are best considered together [17]. Section 3 on Enhanced 
Automation gives an approach to proving the lemmas in this collection. 

These lemmas can be expressed in the first-order language of the real numbers; 
that is, they can be expressed in the syntax of first-order logic with equality (allowing 
quantifiers Vx and 4x with variables running over the real numbers), the real constants 
0, 1, and ring operations (+), (—), (-) on R. In fact, L. Fejes T6th’s statement of 
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the Kepler conjecture as an optimization problem in a finite number of variables 
can itself be expressed in the first-order language of the real numbers [5]. (For this, 
the truncation used by L. Fejes Toth must be modified slightly so that the truncated 
Voronoi cells are polyhedra.) Thus, it should come as no surprise that many of the 
intermediate lemmas in the proof can also be expressed in this manner. 

For example, consider the statement asserting the existence of a circumcenter of a 
triangle: if three points in the plane are not collinear, then there exists a point in the 
plane that is equidistant from all three. This can be expressed in elementary terms 
as follows: for every (x1, y1), (%2, y2), (%3, y3), if there do not exist f), 2, and f3 for 
which f; (x1, y1) + f2(%2, y2) + £3(%3, y3) = (0,0) and t; + % + = 1, then there 
exists (x, y) such that 


@— m+ Q-m) = 2) + (y— 2)? = @ — 3) + — 93)”. 
2.2 Background Material 


There is now a close match between what the blueprint edition assumes as back- 
ground and what the proof assistant HOL Light provides, as described in Sect. 3. The 
blueprint edition develops substantial background material in trigonometry, measure 
and integration, hypermaps, and fans (a geometric realization of a hypermap). It turns 
out that only a small number of primitive volumes need to be computed for the proof 
of the Kepler conjecture. These primitives include the volume of a ball, a tetrahedron, 
and right circular cone. No line or surface integrals are required. 

A hypermap consists of three permutations e,n, f (ona finite set D) that compose 
to the identity eono f =T. A hypermap is the combinatorial structure used by 
Gonthier in his formal proof of the four-color theorem. In 2005, the proof of the 
Kepler conjecture was rewritten in terms of hypermaps, because it is better suited for 
formal proofs than planar graphs. 

The Jordan curve theorem (JCT) has been formalized as a step in the Flyspeck 
project [19]. In fact, something weaker than the JCT suffices. The project only uses 
the JCT for curves on the surface of a unit sphere consisting of a finite number of arcs 
of great circles; that is, a spherical polygonal version of the JCT. In the proof of the 
Kepler conjecture, the combinatorial structure of a cluster of spheres is encoded as 
a hypermap. An Euler characteristic calculation, based on the JCT, shows that these 
hypermaps are planar. The planarity of these hypermaps is a crucial property that 
is used in the enumeration of tame graphs (Sect. 6). The background chapter in the 
blueprint edition contains detailed proofs of these facts. 

The blueprint edition contains several introductory essays that introduce the main 
concepts in the proof, including the algorithms implemented by the computer code. 

The formalization of the blueprint text started in 2008 with work of J. Rute (CMU) 
and the Hanoi Flyspeck group. The current members of this group are Tran Nam 
Trung, Nguyén Tat Thang, Hoang Lé Truong, Nguyén Quang Truong, Vii Khiac Ky, 
Nguyén Anh Tam, Nguyén Tuyén Hoang, Nguyén Dic Phuong, Vuong Anh Quyén, 
Phan Hoang Chon, and managed by Ta Thi Hoai An. The blueprint formalization is 
still at an early stage. 
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3 Formalizing the Ordinary Mathematics 


This section describes some of the work on formalizing Euclidean space and measure 
theory, and the development of further proof automation, which should be useful in 
this endeavor. 

The computer code in Flyspeck has so far received the lion’s share of the formal 
effort. This is entirely reasonable since there are, or at least were, real questions about 
the feasibility of reproducing these results in a formal way. However, the Flyspeck 
proof includes a large amount of “ordinary” mathematics, which also needs to be 
formalized. Here we are on fairly safe ground in principle, because by now we un- 
derstand the formalization of mainstream mathematics quite well [48]. It is safe to 
predict that this formalization can be done, and we can even hope for a reasonably 
accurate estimate of the effort involved. Nevertheless, the formalization is certainly 
nontrivial and will require considerable work. 


3.1 Formalizing Euclidean Space 


Much of our work has been devoted to developing a solid general theory of Euclidean 
space RY [26]. For Flyspeck, we invariably just need the special case R*. While some 
concepts, e.g., vector cross products, are specific to R*, most of the theory has been 
developed for general RY so as to be more widely applicable. The theorem prover 
HOL Light [25] is based on a logic without dependent types, but we can still encode 
the index N as a type (roughly, an arbitrary indexing type of size N). This means that 
theorems about specific sizes like 3 really are just type instantiations of theorems for 
general N stated with polymorphic type variables. The theory contains the following: 


e Basic properties of vectors in RY, linear operators and matrices, dimensions of 
vector subspaces, and other bits of linear algebra. For example, the following is a 
formal statement of the theorem that a square matrix A’ is a left inverse to another 
one A iff it is a right inverse. Note that the double use of the same type variable N 
constrains the theorem to square matrices: 


|- VA:real*N*N A’:real*N‘N. 
(A ** A’ = mat 1) —& (A’ ** A = mat 1) 


e Metric and topological notions like distances, open sets, closure, compactness, and 
paths. Some of these are very general, others are more specific to Euclidean space. 
Some results include the Heine—Borel theorem, the Banach fixed-point theorem, 
and Brouwer’s fixed-point theorem. The following is a formal statement that con- 
tinuous functions preserve connectedness. 


|- Vf:real*M->real*N s. £ continuous_on s A. connected s 
=> connected(IMAGE f s) 


e Properties of convex sets, convex hulls, cones, etc. Results include Helly’s theo- 
rem, Carathéodory’s theorem, and various classic results about separating and sup- 
porting hyperplanes. The following states a simpler but not entirely trivial result 
that convex hulls preserve compactness. 


|- Vs:real*N->bool. compact s > compact (convex hull s) 
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e Sequences and series of vectors and uniform convergence, Fréchet derivatives and 
their properties, up to various forms of the inverse function theorem, as well as spe- 
cific one-dimensional theorems like Rolle’s theorem and the Mean Value Theorem. 
Here is the formal statement of the chain rule for Fréchet derivatives. 


|- V£:real*M->real*N g:real*N->real*P f’ g’. 


(£ has_derivative £’) (at x) A 
(g has_derivative g’) (at (f x)) 
=> ((g o £) has_derivative (g’ o £’)) (at x) 


3.2 Formalizing Measure Theory 


Although the basic Euclidean theory is an important foundation, and many of the con- 
cepts like “convex hull” are used extensively in the Flyspeck mathematics, perhaps 
the most important thing to formalize is the concept of volume. 

We define integrals of general vector-valued functions over subsets of RY, using 
the Kurzweil—Henstock gauge integral definition. We develop all the usual properties 
such as additivity and the key monotone and dominated convergence theorems. We 
also develop a theory of absolutely integrable functions, where both f and | f| are 
gauge integrable; this is known to coincide with the Lebesgue integral. Here is a 
formal statement of the simple theorem that integration preserves linear scaling: 


|- Vi:real*M->real*N y s h:real*N->real*P. 
(£ has_integral y) s A linear h 
=> ((h o £) has_integral h(y)) s 


Using this integral applied to characteristic functions, we develop a theory of 
(Lebesgue) measure, which of course gives volume in the three-dimensional case. 
The specific notion “measure zero” is formalized as negligible, and we also have 
a general notion of a set having a finite measure, and a function measure to return 
that measure when it exists. For example, this is the basic additivity theorem: 


|- Vs t. measurable s A measurable t A DISJOINT s t 
=> measure(s UNION t) = measure s + measure t 


We have proved that various “well-behaved” sets such as bounded convex ones 
and compact ones, or more generally those with negligible frontier (boundary) are 
measurable, e.g., 


|- Vs:real*N->bool. 
bounded s A negligible(frontier s) => measurable s 


The main lack at the moment is a set of results for actually computing the measures 
of specific sets, as needed for Flyspeck. We can evaluate most basic one-dimensional 
integrals by appealing to the Fundamental Theorem of Calculus, but we need to en- 
hance the theory of integration with stronger Fubini-type results so that we can evalu- 
ate multiple integrals by iterated one-dimensional integrals. This work is in progress 
at the time of writing. 
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3.3. Enhanced Automation 


Using coordinates, many nontrivial geometric statements in R°, or other Euclidean 
spaces of specific finite dimension, can be reduced purely to the elementary theory of 
reals. This is known to be decidable using quantifier elimination [3, 29, 45]. However, 
in practice, this is often problematic because quantifier elimination for nonlinear for- 
mulas is inefficient. The problem is particularly severe if we want to have any kind 
of formal proof, as we do in Flyspeck, since producing such a proof induces further 
slowdowns [31, 34]. With this in mind, we have explored a different approach to the 
case of purely universally quantified formulas [27], based on ideas of Parrilo [41]. 
This involves reducing the initial problem to semidefinite programming, solving the 
SDP problem using an external tool and reconstructing a “sum-of-squares” (SOS) 
certificate that can easily be formally checked. 

For example, suppose we wish to verify that if a quadratic equation ax? ++bx +c = 
0 has a real root, then b? > 4ac. Using the SDP solver, we find an algebraic certificate 
b* — 4ac = (2ax + b)* — 4a(ax* + bx +c), from which the required fact follows 
easily: (2ax + b)* > 0 because it is a square, and 4a (ax? + bx +c) =0 because x is 
aroot, and so we deduce b* — 4ac > 0. This method seems very useful for automating 
routine nonlinear reasoning in a way that is easy and quick to formally verify, so that 
we do not have to rely on the correctness of a complicated program. It is even capable 
of solving the coordinate forms of some of the simpler Flyspeck inequalities directly, 
though it seems unlikely to be competitive with customized nonlinear optimization 
methods as described in Sect. 5. For example, one simple Flyspeck inequality is the 
following, which after being reduced to a real problem with nine variables (three 
coordinates for each point) is solved by SOS in a second: 


lle — vl] = 2A lu — wll =2 A |v—wll >2 A lu — vl < v8 


=> |lw— G+ v)/2|| > |lu — vf]/2. 


A quite different approach to geometric theorem proving is to work in the setting 
of a general real vector space or normed real vector space. In this case, other deci- 
sion methods are available [44]. In particular, one of these decision procedures that 
we have implemented in HOL Light can sometimes handle simple forms of spatial 
reasoning in a purely “linear” way, and so be much more efficient than the direct re- 
duction to coordinates, even if we do in fact have a specific dimension in mind. One 
real example from formalizing complex analysis is the following in R?: 


I|lw—zl|-rl=dA |u—wi<d/2allx—zl=r = d/2<|x—ul. 


4 Standard ML Reimplementation of Code 


This section describes a reimplementation of the computer code used in the proof of 
the Kepler conjecture. The code has been substantially redesigned to avoid various 
difficulties with the original implementation. 
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4.1 Code 


The original proof of the Kepler conjecture relies significantly on computation. Com- 
puter code is used extensively and is central both to the correctness of the result and 
to a thorough understanding of the proof. 

There are four major difficulties with understanding and verifying the original 
code base. The first and most glaring difficulty is simply the amount of code. At the 
website [11] that posts the code for the original proof, there are well over 50,000 lines 
of programs in Java, C++, and Mathematica (among others). This represents only the 
calculations that Hales did himself. Samuel Ferguson also completed many of the 
calculations with an entirely different code base! of 137,000 lines of C. By contrast, 
the proof of the four-color theorem by Robinson et al. [43] is less than 3,000 lines 
of C. 

The second difficulty is in the organization of the code. The calculations were 
done over the space of four years and involved thousands of executions of a multi- 
tude of independent programs. Section 1.2 identifies three main computational tasks: 
tame graph generation, linear program bounding, and nonlinear inequality verifica- 
tion. Each of these main tasks consists of several subtasks. For example, verifying the 
inequalities required dozens of relatively complicated preprocessing phases where 
second derivatives of the relevant functions were bounded over fixed domains. As 
another example, many linear programs were solved only after a branch and bound 
period which were recorded in voluminous log files. In an attempt to organize the 
complex web of calculations, Hales devised a labeling scheme to uniquely identify 
the calculations. However, even now some computations relied upon by the proof 
are difficult to find in the original source code. To locate, for instance, computation 
“CALC-821707685” from the original proof [23, p. 159], one can search on the web- 
site [11] in vain. While records of the computations were made, it is not always an 
easy matter to find them without guidance from the authors. 

The third difficulty lies in the complexity of the implementation. For instance, the 
software developed to prove the inequalities upon which the original proof rests is 
relatively complicated. Processing power at the time (1994-1998) was just barely 
capable of completing the computations requested. To keep the length of execution 
to days or weeks instead of months or years, the code is extensively optimized. The 
optimizations were often implemented without comment in the source and in some 
cases are difficult to understand. 

The fourth difficulty is that the original code uses C and C++ to carry out interval 
arithmetic calculations based on floating point arithmetic. In the process, it explicitly 
sets the IEEE 754 [30] rounding modes on the processor’s floating point unit. While 
floating point is desirable for its speed, there are difficulties with using floating point 
for software that requires a very high level of rigor such as that supporting mathe- 
matical proof. The first is that reasoning about floating point instructions requires a 
relatively deep understanding of the machine architecture [36]. For instance, setting 
the rounding mode changes the state of the processor itself. Such an instruction has a 


'There is a large amount of code copying in Ferguson’s code, resulting in a much larger code base. The 
number of distinct lines is difficult to measure. 
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global effect on all subsequent floating point computations. In the original code base, 
the rounding modes are explicitly changed at least 400 times. Moreover, compilers, 
libraries, and even processors are notorious for unsound implementations of the 754 
standard. 


4.2 Reimplementation 


In 2004, we decided to reimplement the original code base. We decided that the 
new implementation should not require floating point numbers and rounding modes. 
Though speed was important, we wanted the code to be independent of any particu- 
lar interval arithmetic implementation. This meant we could use a fast floating point 
implementation of interval arithmetic for our daily work, but could use a slower but 
more trustworthy implementation such as MPFI [42] to double check important com- 
putations. We also wanted to organize the new implementation such that any of the 
many computations upon which the proof relies could be evaluated from a single in- 
terface. This would allow Flyspeck developers to find and easily check the text of the 
proof during the formalization process. Finally, we wished to bring the computational 
aspects of the Kepler conjecture closer to the level of simplicity and clarity necessary 
for formalization by a proof assistant. We began this work in the spirit of Robinson 
et al. [43], which simplified the original code of Appel and Haken [1] and was used 
by Gonthier to construct the fully formal proof [8] in the Coq proof assistant. 

We chose Standard ML for the reimplementation for a number of reasons. It has 
a formal definition [35], and thus programs have a meaning apart from the partic- 
ular compiler used. It has an efficient compiler, named MLton [46]. (Our reimple- 
mentation runs between 50% and 200% the speed of the original implementation 
compiled with GCC.) MLton has the ability to use external libraries written in lan- 
guages other than SML with relative ease. This allowed us, from one programming 
environment, to control multiple linear programming solvers, interval arithmetic im- 
plementations, and nonlinear optimization packages. SML has an expressive module 
system, and thus it was simple to write our code with respect to an abstract type of 
interval arithmetic. Thus we could use multiple independent implementations with 
ease. As of 2008, most of the code has been completely rewritten in SML and is 
executable from a single command-line program. The code is freely available at the 
project website [32]. 

In the original implementation, the myriad computations were done with many 
different programs written in a half dozen programming languages. The results of 
these computations are not always easy to find or interpret. Now all the computations 
are executed from the same source, with organized output. In addition to giving us 
added confidence that the original computations were sound, we have a fairly com- 
plete suite of software support for the Flyspeck project. We are now in the process 
of organizing and reevaluating the thousands of computations upon which the proof 
depends. 


5 Proving Nonlinear Inequalities with Bernstein Bases 


The hardest computational part of the original proof of the Kepler conjecture is the 
verification of a list of about a thousand nonlinear inequalities. This section presents 
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a technique aimed at proving them, based on polynomial approximation and Bern- 
stein bases. We feel that this approach better fits the requirements of formal proof, as 
outlined in Sect. 4.2. We hope to refine the method to cover all Flyspeck inequalities. 

We exhibit the method on a single inequality CALC-586468779. The original 
proof contains the following definitions [10]: 


pss chai v2 
i= arctan —, 
e 3 5 


v2 


2, 
gz — 4arctan = 


pp: , 
Cl £ ft 4 
1 98 ag ye 


Soct = 


1 2 2 2 
A(y):= 5]! y3 O yy YS 


ly yy 0 


1 yp ye y2 0 


= 1) 9 ——e. 2 2 
ao(y) = yi y2y3 + 5 (192 + YY. + YP Y3 + Y23 + Y1Y3 


+ y2y3 — 14 — Y2YS — Y3¥6), 
ai(y) = ao(y1, Ys, Yo, V4, V2, V3), 
a2(y) = ao(y2, Y4, Yo, YS, Vis Y3), 
a3(y) = ao(y4, 5, Y3, V1, Y2 Yo), 


3 
Soct 2< VACy) 
= —-— /A(y)+ = arctan : 
y(y) 6 (y) 3 dX re 
The statement of the inequality is: 
Vy €[2,2.51]© y(y) <pt. (1) 


Define the difference of two intervals by [a1, b1] — [a2, b2] = [a1 — b2, bj — az]. In- 
terval arithmetic is used to prove the inequalities in the original proof. It suffers from 
the dependency problem: the minimum and maximum of the formula x — x are over- 
estimated because [a, b] — [a, b] = [a — b, b — a], although x — x is clearly 0. Subdi- 
viding [a, b] into [a, a4P] and [se b], and then re-evaluating the formula yields an 
improved result. However, depending on the problem, the number of required subdi- 
visions can be very large. This is why checking some inequalities takes a very long 


time. 
5.1 From Geometrical Functions to Polynomials 


Fortunately, better methods than interval arithmetic are available if the function under 
consideration is polynomial. A quick look at y tells us that (1) is not polynomial, 
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since it has occurrences of ./-, 1/-, and arctan. Can it nevertheless be reduced to a 
polynomial problem? Two strategies come to mind: 

First, algebraic laws such as fa <bSa< b? af b > 0) and 3 <csa<be 
(if b > 0) can often be used to eliminate occurrences of ./- and 1/-. The list of 
trigonometric identities is endless. For our example, Vega’s rule arctana + arctan b = 
arctan nae seems useful. Unfortunately, this technique quite often yields huge ex- 
pressions that are difficult to deal with by virtue of their sheer size. Also, an algebraic 
transformation to a polynomial problem may simply be impossible (we suspect that 
this is the case for (1)). 

A second technique is based on replacing y with a polynomial g that dominates it 
but is still smaller than pt. Clearly, if there exists a g such that 


Vy €[2,2.511° y(y) <8) (2) 
and 
Vy €[2,2.51]© g(y) <pt, (3) 


then by transitivity (1) holds. 

Such a polynomial g can be obtained by replacing ./-, 1/-, and arctan with poly- 
nomial approximations. We only need to ensure that we use upper approximations for 
positive occurrences and lower approximations for negative ones. Only occurrences 
whose arguments contain variables need to be replaced, since, e.g., 2 is a (constant) 
polynomial itself. 

In the definition of y, the function arctan occurs positively, so it is replaced by an 


upper approximation arctan. The term vat is first unfolded to ./AQ) - a Both 
the square root and reciprocal occur positively again here, so they can be replaced 
by upper approximations sqrt and rep, respectively. This yields arctan(sqrt(A(y)) - 
rcep(a;(y))) in all four summands. There remains only ./- occurring negatively after 
— Bost , which is to be replaced by a lower approximation sqrt. 


We choose the following approximations: 


—_— J2 25 J2 
arctan(t) := arctan + t ‘ 
2) 27 a 
rate) 1 37t e 1303 : a 
rc — 
4 4 1600 1000 640000 6400000’ 


3 
64(2 — 4arctan 2) 


sqrt(t) = 8V2+ (¢— 128), 


1 
= 84/2. (p= 198), 
sqrt(t) + ea" ) 


These approximations are valid on the domain (1). For example, 


Vt € A([2,2.51]°) Vf <sqrt(t). 
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This can be established by elementary means, knowing that A((2,2.51]*) < 
[128; 501]. The latter can be shown automatically by the method outlined in the 
next subsection. 

In summary, we arrive at the following definition of g: 


3 


Soc 2 —___ __ 
e(y) i= ——esart(A(y)) + 5 > arctan(sqrt(A(y)) - Fp(a;(y))). 
i=0 


In form, it is almost identical to the definition of y. Our construction of g therefore 
ensures (2). Moreover, the approximations were chosen (using polynomial interpola- 
tion) in a way such that 


y (2, 2,2, 2, 2,2) = g(2, 2, 2, 2, 2,2) = pt. (4) 
This is important, because otherwise (3) cannot hold. 
5.2 Bounding Polynomials 


In order to prove (1), it remains to be shown that g(y) < pt. This can be done with 
the help of Bernstein polynomials. We briefly outline the case of a single variable x 
here. 

The ith Bernstein basis polynomial of order k is defined as 


BE (x) := (Js =x), 
i 


For a polynomial p and a vector b € R*, if 


k 
p(x) = 2b BY), 


i=0 
then b is called the Bernstein representation of p. In this case, 


Vx €[0;1] p(x) < maxbj. 
L 


This property is tremendously useful: it gives us an upper bound on p, namely the 
largest coefficient of p’s Bernstein representation. By a change of variable we can 
reduce any interval to [0, 1]. The generalization to the multivariate case is straight- 
forward [7, 49]. 

In order to bound a polynomial, it thus suffices to convert it into Bernstein repre- 
sentation. This can be done by a matrix multiplication (the Bernstein basis of order k 
forms a basis of the vector space of all polynomials of degree up to k). For practical 
purposes, it is however crucial to use a more efficient algorithm (cf. [7, 49]). 

Note that g contains irrational coefficients. This is a consequence of require- 
ment (4) and cannot be avoided. However, we were able to choose the approximation 
polynomials in a way such that the transcendental parts can be factored out (hence 
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J2 


the occurrence of 2 — 4 arctan *= in the definition of sqrt). As can be easily checked 


with symbolic algebra software, the polynomial p(y) := /2(g(y) — pt) has rational 
coefficients! It can thus be converted to a Bernstein representation without rounding, 
using the algorithm presented in [49]. With this method the only divisions are by 
powers of 2, which can be efficiently represented using dyadic numbers. 

The polynomial p consists of 12945 monomials and has total degree 18. A pro- 
totype implementation in Haskell returns 0 as the maximum for p in about half a 
minute. Thus /2(g(y) — pt) < Oand g(y) < pt. 


6 Tame Graph Enumeration 


Tame graphs are particular plane graphs that represent potential counterexamples to 
the Kepler conjecture. The Archive is a list of over 5000 plane graphs. The original 
proof generates the Archive with the help of a Java program that enumerates all plane 
graphs. Tameness is defined in Sect. 18, and the enumeration is sketched in Sect. 19 
of [23]. This section sketches the formally machine-checked proof of Claim 3.13 and 
Theorem 19.1 in the original proof [23]: 


Theorem 1 Any tame plane graph is isomorphic to a graph in the Archive. 


There are two potential reasons why an error in the original proof of this theorem 
might have gone undetected: the publications only sketch the details of the enumer- 
ation, and the referees only made a passing glance at the implementation, consisting 
of more than 2000 lines of Java. 

We recast the Java program for the enumeration of all tame graphs in logic, proved 
its completeness with the help of an interactive theorem prover, ran it, and compared 
the output to the Archive. It turns out that the original proof was right, the Archive 
is complete, although redundant (there are at most 2771 tame graphs). Doing all this 
inside a logic and a theorem prover requires two things: 


e The logic must contain a programming language. We used Church’s higher-order 
logic (HOL) based on (-calculus, the foundation of functional programming. Pro- 
grams in HOL are simply sets of recursion equations, i.e., pure logic. 

e The programming language contained in the logic must be efficiently executable, 
and such executions must count as proofs. The theorem prover that we used, Is- 
abelle/HOL [37], fulfills this criterion. If all functions that appear in a term are e1- 
ther data, e.g., numbers, or functions defined by recursion equations, Isabelle/HOL 
offers the possibility to evaluate this term ¢ in one big and relatively efficient step 
to a value v, giving rise to the theorem ¢t = v. 


The enumeration of all tame graphs generates 23 million plane graphs—hence the 
need to perform massive computations in reasonable time. 

Now we give a top-level overview of the formalization and proof of complete- 
ness of the enumeration of tame graphs in HOL. For details, see [38]. The complete 
machine-checked proof, over 17000 lines, is available online in the Archive of Formal 
Proofs at afp.sf.net [2]. 
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6.1 Plane Graphs 


Following the original proof, we represent finite, undirected, plane graphs as lists 
(= finite sets) of faces and faces as lists of vertices. Note that by representing faces as 
lists they have an orientation. The enumeration of plane graphs requires an additional 
distinction between final and nonfinal faces. Hence a face is really a pair of a list of 
vertices and a Boolean. A plane graph is final iff each of its faces is. In final graphs, 
we can ignore the Boolean component of the faces. 


6.2 Enumeration of Plane Graphs 


The original proof characterizes plane graphs by an executable enumeration and 
sketches a proof of completeness of this enumeration. We have followed the origi- 
nal proof and taken this enumeration as the definition of planarity. The enumeration 
of plane graphs in the original proof proceeds inductively: you start with a seed graph 
with two faces, the final outer one and the (reverse) nonfinal inner one. If a graph con- 
tains a nonfinal face, it can be subdivided into a final face and any number of nonfinal 
ones. Because a face can be subdivided in many ways, this process defines a tree of 
graphs. By construction the leaves must be final graphs, and they are the plane graphs 
we are interested in: any plane graph of n faces can be generated in n — | steps by 
this process, adding one (final) face at a time. For details, see [23] or [38]. 

The enumeration is parameterized by a natural number p which controls the max- 
imal size of final faces in the generated graphs. The seed graph Seed, contains two 
(p + 3)-gons, and the final face created in each step may at most be a (p + 3)-gon. 
As a result, different parameters lead to disjoint sets of graphs. 

The HOL formalization defines an executable function next-plane, that maps a 
graph to a list of graphs, the successor graphs reachable by subdividing one nonfinal 
face. The plane graphs are the final graphs reachable from Seed, via next-plane p for 
some p. 


6.3 Enumeration of Tame Graphs 


The definition of tameness in the original proof is already quite close to a direct logi- 
cal formulation. Hence the HOL formalization is very close to this. Of course pictures 
of graphs had to be translated into formulae, taking implicit symmetries in pictures 
into account. We found one simplification: in the definition of an admissible weight 
assignment, one can drop condition 3 (a condition on the 4-circuits in graphs) without 
changing the set of tame graphs. What facilitated our work considerably was that a 
number of the eight tameness conditions in the original proof are directly executable. 
The details are described elsewhere [38]. 

The enumeration of tame graphs is a modified enumeration of plane graphs where 
we remove final graphs that are definitely not tame, and prune the search tree at non- 
final graphs that cannot lead to tame graphs anymore. The published description [23] 
is deliberately sketchy, and the precise formulation of the pruning criteria is based 
on the original Java programs. This is the most delicate part of the proof because we 
need to balance effectiveness of pruning with simplicity of the completeness proof: 
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weak pruning criteria are easy to justify but lead to unacceptable run times of the 
enumeration, sophisticated pruning techniques are difficult to justify formally. Since 
computer-assisted proofs are still very laborious, simplifying those proofs was of 
prime importance. In the end, the HOL formalization defines a function next-tame p 
from a graph to a graph list. It computes the list of plane successor graphs next-plane p 
g and post-processes it as follows: 


(1) Remove all graphs from the list that cannot lead to tame graphs because of lower 
bound estimates for the total admissible weight of the final graph. 

(2) Finalize all triangles in all of the graphs in the list (because every 3-cycle in a 
tame graph must be a face). 

(3) Remove final graphs that are not tame from the list. 


A necessary but possibly not sufficient check for tameness is used in the last step. 
Hence the enumeration may actually produce nontame graphs. This is unproblematic: 
in the worst case, a fake counterexample to the Kepler conjecture is produced, but we 
do not miss any real ones. 

Although we have roughly followed the procedure of the original proof, we have 
simplified it in many places. In particular, we removed the special treatment of Seedo 
and Seed), which is a fairly intricate optimization that turned out to be unnecessary. 

The following completeness theorem is the key result: 


Theorem 2 /f a tame and final graph g is reachable from Seedy via next-plane p, 
then g is also reachable from Seed y via next-tame p. 


Each step next-tame, is executable, and an exhaustive enumeration of all graphs 
reachable from a seed graph is easily defined on top of it. We call this function 
tameEnum,. By definition, tame graphs may contain only triangles up to octagons, 
which corresponds to the parameters p = 0,...,5. 


6.4 Archive 


In order to build on the above enumeration of all tame graphs without having to 
rerun the enumeration, the results of running tameEnumy with p =0,...,5 are put 
into an Archive, and isomorphic graphs are eliminated. This results in 2771 graphs, 
as opposed to 5128 in the original proof. The reasons are twofold: there are many 
isomorphic copies of graphs in the Archive, and it contains a number of nontame 
graphs, partly because, for efficiency reasons, the original proof did not enforce all 
tameness conditions in its Java program. The new reduced Archive is also available 
online [2]. 

Finally we can prove Theorem 1: if g is tame plane graph, Theorem 2 and the 
definition of tameEnum tell us that g must be contained in tameEnum, for some 
p =0,...,5. Hence it suffices to enumerate tameEnum,, p =0,...,5, and check 
that, modulo graph isomorphism, the result is the same as the Archive. This is a 
proposition that can be proved by executing it (because the HOL formalization also 
includes a verified executable test for graph isomorphism which we do not discuss). 
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7 Verifying Linear Programs 


This section reports on the current state of the formal verification of the linear pro- 
gramming part of the proof of the Kepler conjecture. The results of the linear pro- 
gramming in the original proof are recorded in several gigabytes of log files. This 
section presents the formalization of the generation and bounding of these linear pro- 
grams in the mechanical proof assistant Isabelle [37]. A more detailed version of the 
material presented here can be found in S. Obua’s thesis [39]. 

This formalization relies on the archive of tame graphs. Each tame graph (except 
the graphs associated with the face-centered cubic and hexagonal close packings) 
represents a potential counterexample to the Kepler conjecture. Each potential coun- 
terexample obeys certain constraints. The original proof refutes each potential coun- 
terexample by building a linear program from the constraints and showing that these 
linear programs are infeasible. 

A structure, which we call a graph system, makes this precise in a formal envi- 
ronment. An instance of a graph system is a tame graph which obeys the constraints 
listed in the definition of a graph system. The current formalization does not use all 
of the constraints of the original proof but only those that do not require branch and 
bound strategies. Our current notion of graph system can be viewed as a detailed 
formalization of what is called basic linear programs in [23, Sect. 23.3]. Because 
the current formalization does not capture not all constraints of the original proof, 
we cannot hope to refute all potential counterexamples. Nevertheless we manage to 
refute most of them. 

We represent tame graphs as hypermaps [8, 20]. As mentioned in Sect. 2, a hyper- 
map is just a finite set D of darts together with three permutations on D: the edge, 
the face, and the node permutation that compose to the identity eono f = J. See 
Fig. 1. This representation greatly simplifies the axiomatization of a graph system. 
For a detailed description of how we represent hypermaps and for a complete list of 
all of the axioms of a graph system, see [39, Sect. 4]. 

Figure 2 summarizes how we generate and solve the linear programs. We apply the 
axioms of a graph system to each tame graph. This results in a large Isabelle theorem 
which is a conjunction of linear equalities and inequalities. We then normalize this 


ats 
\,7 
TAY 4g 
a 


Fig. 1 A hypermap is a combinatorial structure attached to a planar graph. A dart, represented as a small 
triangle, is place at each face angle of the graph. In the second frame, the face permutation f cycles 
through the darts in each face. In the third frame, the node permutation n cycles through the darts at each 
node. The edge permutation e (not shown) is an involution exchanging darts from opposite ends of the 
edge of a graph. (This figure has been reproduced from [24].) 
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Fig. 2 Refuting a potential 


counterexample to the Kepler Graph System 
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conjunction to bring it into the form of a matrix inequality 


Ax <b. (5) 


The entries of A and b are symbolic expressions which contain various real constants 
(such as zr) that are needed to define the axioms of a graph system. In order to apply 
linear programming, we need to replace A and b with numerical approximations. 
We achieve this via a formalization of interval arithmetic in Isabelle and arrive at 
numerical matrices A’, A”, and b’ for which we have the formally proven Isabelle 
theorems 


A’<A< A", b<D’. (6) 


We then apply a simple preprocessing step which gives us formally proven a priori 
bounds x’ and x” for x such that 


xi <x<x", (7) 


It is then possible to obtain a certificate from an external linear programming solver 
like GLPK (Gnu Linear Programming Kit) that allows us to formally reach a contra- 
diction from Ax < b. The beauty of a certificate is that we can use results obtained 
from an untrusted source (in our case, this untrusted source is a heavily optimized 
linear program solver programmed in C) in a trusted and completely mechanically 
verifiable way. 

In this way, we proved the inconsistency of 2565 of the graph systems but failed 
to prove the inconsistency of the remaining 206. This yields a success rate of about 
92.5%. Future work will extend the notion of graph system and generate linear pro- 
grams that take all the constraints of the original proof into account. 
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Part 2. Addendum to and Errata in the Original Proof 
8 Biconnected Graphs 


This section gives further detail to the argument of [15, Sect. 12.7, p. 131]. There it 
is claimed that the proof of the main estimate [15, Theorem 12.1] can be reduced to 
the case of polygonal standard regions. This claim is correct. However, the justifica- 
tion of this claim is not complete in the original proof. This section gives complete 
justification of the claim. The main result is Theorem 3 stated below. 

This section patches the original proof. As such, it should be imagined this section 
to be inserted as an addendum, directly following Sect. 12.13 in the original proof, 
as a new Sect. 12.14. (In fact, Sect. 12.13 is itself an addendum to the 1998 preprint, 
making this section the second addendum to Sect. 12.) If the actual insertion were 
to be carried out, the corresponding changes to the numbering of the results in this 
section would be made: Theorem 3 below would be inserted as a new Theorem 12.21, 
and so forth. 

In the original proof, the boundaries of standard regions may fail to be simple 
polygons. In fact, the failure of the boundaries to be polygons may be quite severe: 
the boundaries may contain multiple components and articulation vertices. An articu- 
lation vertex in a graph is a vertex whose removal increases the number of connected 
components of the graph. A connected graph is biconnected if it contains no articu- 
lation vertex. 

In the original proof, each standard region is first prepared by a process of erasure 
(described in [15, Sect. 11.1]), then deformed in a way to transform its boundary into 
a simple polygon. The deformation is made in such a way that the values of two key 
functions voro,r and To,r are left unchanged. These are the functions that enter into 
the main estimate [15, Theorem 12.1]. 

What the original proof fails to consider is a particular hypothetical situation where 
the deformation might fail. This situation is illustrated in Fig. 3, where the rigid move- 
ment of set of vertices (such as the illustrated triangle with vertex w) is blocked by a 
nearby vertex v that is not visible from w, when the distance between v and w drops 
to the minimum value 2. This section presents a proof that this hypothetical situation 
does not occur. 

The strategy of the proof in this section is to use a deformation argument. A gen- 
eral decomposition star is deformed until the graph attached to it becomes bicon- 
nected. In a biconnected planar graph with at least three vertices, each face is a sim- 
ple polygon. Thus, by producing a biconnected graph, we achieve our objective. In 
the original proof of the Kepler conjecture, biconnected graphs are not mentioned by 
name. Nevertheless, most of the graphs that occur in the late stages of the original 
proof are biconnected. 

In more detail, the proof in this section will produce a sequence of admissible 
deformations of a decomposition star D in such a way that the individual standard 
regions R are preserved in number and identity (but not in shape) by the deforma- 
tions. The deformations will preserve the values of voro,r(D) and to,r(D) for each 
standard region R. The deformations will change the combinatorial structure of the 
graph formed by all the subregions of the decomposition star. At the conclusion of 
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Fig. 3 In this figure (not to scale), segments represent geodesic arcs on the unit sphere. The shaded region 
is a subregion whose boundary is not a simple polygon. We wish to rigidly slide the triangle containing 
the vertex w until a new visible distinguished edge forms (say from w to u). We will show that this rigid 
motion does not decrease the distance between two vertices (say w and v) to the minimum distance 2 


the deformation, the decomposition star D will have a graph (formed by all the sub- 
regions of the decomposition star) that is biconnected. As in the original proof, the 
decomposition star is first simplified by a process of erasure, before the deformations 
begin. 

In the remainder of this section, we adopt notation, definitions, and conventions 
without further comment from [15]. 


8.1 Context 


We work in the following restrictive context for Theorem 3 and its proof. We fix a 
packing centered at a vertex at the origin. As described in Sect. 12.6 of the original 
proof, we assume that all upright quarters are erased, except loops (that is, those 
surrounded by anchored simplices). Let U be the set of (non-null) vertices of height 
at most 2.51. As usual, we say two edges {uw 1, w2} and {ui, us} cross if the interiors 
of the triangles formed by {0, u;, u2} and {0, w,, v5} intersect. 

We form the set of edges E’ between vertices in U, consisting of 


e All standard edges; that is, {v, w} C U such that 0 < ||v — w|| < 2.51. 

e All edges {v, w} C U of an anchored simplex, whenever the upright diagonal of 
the anchored simplex is an unerased loop. 

e All edges {v, w} C U such that 0 < ||v — w|| < /8, where {v, w} does not cross 
any other edge in previous two items. (If two of these edges cross, pick only one 
of them. This can only happen with conflicting diagonals of a quad cluster.) 


These edges do not cross. A special simplex {0, u, v, w} has one edge {v, w} of length 
at least \/8, called the special edge. A special edge has length at most 3.2. The other 
vertex u is called a special vertex (or special corner). Let E = E’ \ S, where S is the 
set of special edges (that is, the edges of special simplices shared with an anchored 
simplex). The projection of the line segments formed by EF to the unit sphere is a 
planar graph. The complement of this graph in the unit sphere is a disjoint union 
of connected components. The closures of these connected components are called 
subregions. 
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We call a loop subregion one that contains an unerased loop {0, v}. If R is a loop 
subregion, then there are no enclosed vertices of height <2.51 over the subregion. The 
corners of R are the anchors of the upright diagonal together with the special corners 
of the subregion. The subregion R is star convex with center point at the projection 
of v to the unit sphere. It follows that the boundary of R is a simple polygon. 

The graph J” with vertices U and edges E is not necessarily connected. The aim 
is to deform U (and D) to create a biconnected graph (without changing the values of 
voro,r(D) and to,r(D) for standard regions R; the values for individual subregions 
will change). Once the graph I” is biconnected, the subregions are simple polygons 
as desired. 

The deformation of U is admissible if it satisfies the following three conditions. 


If the graph I” is not connected, the deformation acts by a rotation about the origin 
on the vertices of a single chosen connected component of J”, leaving all other 
vertices of U fixed. (For example, in Fig. 3, the entire triangle with vertex w may 
be rotated.) In particular, ||v||, for v € U, is constant. 

If the graph I” is connected, but not biconnected, with a chosen articulation vertex 
a, then the deformation acts by a rotation about the axis {0, a} on the vertices of a 
single component of I” \ {a}, leaving all other vertices of U fixed. 

e The distance between v, w remains at least 2 for all v, we U. 


The deformation stops when either of the following halting conditions are met: 


(H1) ||v — w|| < V8, where the edge {v, w} ¢ E’, with {v, w} C U, does not cross 
any edge in E’, or 
(H2) ||v — w|| decreases to 2 for some v, w € U, and {v, w} crosses an edge in E’. 


We warn that the deformation is allowed to assume configurations in which the 
distance between some u € U and the upper endpoint of an unerased upright diagonal 
d is less than 2. This is not a problem because the vertex u does not end up as a 
corner of the loop subregion to which the upright diagonal d belongs. (In particular, 
the halting condition (H1) prevents a new special vertex from being added to a loop 
subregion.) Thus, when a single subregion is considered in isolation, all distances 
between pairs of vertices in that subregion are at least 2. 

In the restrictive context that has been described in this section, we have the fol- 
lowing result. 


Theorem 3 /f the graph I’ = (U, E) is not biconnected, then a nontrivial admissi- 
ble deformation of U exists. The halting condition (H2) never holds. The admissible 
deformation can always be continued until the halting condition (H1) holds. 


We break the proof of Theorem 3 into cases (a), (b), (c), (d) according to the num- 
ber and combinatorial structure of the edges {u;, u2} that pass through the triangle 
{0, v, w}. See Fig. 4. Lemma 4 below gives Case (a): we cannot have two such edges 
{u,,u2} and {u\, us} with distinct endpoints. It follows that there is an endpoint u2 
shared by every edge that passes through the triangle {0, v, w}. Lemma 9 below gives 
Case (b): there cannot be three or more edges. Finally, Cases (c) and (d) of one or 
two edges {uw ;,u2} are treated in further subsections. Each subsection is organized 
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= 


Fig. 4 Theorem 3 breaks into various cases: (a) two edges with distinct endpoints, (b) three or more 
edges, (c) two edges, or (d) a single edge passing through the triangle {0, v, w} 


around one of the main cases (with introductory Sect. 8.2): Cases (a), (b), (c), and (d) 
are treated in the four Sects. 8.3, 8.4, 8.5, 8.6, respectively. 

When U is deformed until the halting condition (H1) holds, a new edge {v, w} 
can be added to E. The theorem is repeated to add further edges, until a biconnected 
graph is obtained. The remainder of this section explains why the halting condition 
(H2) never holds. For this, we assume, on the contrary, that ||v — w|| = 2, where 
{u, w} belong to different connected components of I” (Gif I” is not connected) and 
different connected components of J” \ {a} (if a is an articulation vertex of a con- 
nected graph J”). When this situation occurs, we say that v, w belong to different 
bicomponents. Without loss of generality, we may also assume that {v, w} crosses 
some edge of E’. 


8.2 Preliminary Lemmas 


Lemma | below shows that a vertex does not cross over an edge during a deforma- 
tion; the halting conditions for the vertex are met before it reaches the edge. Recall 
that the term geometric considerations refers to a specific collection of methods in- 
troduced in [23, Sect. 4.2] to prove the existence and nonexistence of various simple 
configurations of points in R?. 


Lemma 1 Let S = {0, v, v1, u2} be a set of four distinct points in R? whose pair- 
wise distances are at least 2. Suppose that ||u|| < 2.51 for all u € S. Suppose that 
lu) — u2|| < 3.2. If the segment {0, v} meets the segment {u,, uz}, then ||v — uy||, 
|v — u2|| < 2.51. 


Proof This follows by geometric considerations. 


The next two lemmas give conditions under which the halting condition (H2) can- 
not hold. 


Lemma 2 Let S = {v, w,u1,u2, vo} be a set of five distinct points in R? whose 
pairwise distances are at least 2. Suppose that the segment {v, w} passes through 
{v0, Uj, U2}. Assume 


||; — ug < 3.2, 


llvo —ul| < 2.51 foruesS. 


Then ||\v — w|| > 2. 
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Fig. 5 In this figure (not to scale), segments represent geodesic arcs on the unit sphere. The two shaded 
regions represent subregions. There are possibly other subregions partitioning the unshaded portions of the 
figure. These other subregions are not represented in this figure. The shaded region marked L represents 
a loop subregion. The other shaded region represents a subregion with two boundary components that is 
undergoing a deformation, by the motion of the triangle with vertex v. The situation of Fig. 3 does not 
exist (as will be shown). The intervening loop subregion L forces v and w to be more than the minimum 
distance apart 


Proof This follows by geometric considerations. 


Lemma 3 Let S = {v, w,u1,u2, v0} be a set of five distinct points in R? whose 
pairwise distances are at least 2. Suppose that the segment {u,, u2} passes through 
{vo, v, w}. Assume 


|v; —u2|| < 2.91, 


luo —ul| < 2.51 forues. 


Then ||v — w|| > 2. 


Proof This follows by geometric considerations. 


By Lemmas 2 and 3, we may assume that for each edge {uw;,u2} that {v, w} 
crosses, we have that {u1, 2} passes through {0, v, w} and that ||w; — u2|| > 2.91. 
This means that the edge {u1, u2} is an edge of an anchored simplex, so that the edge 
is special or bounds a loop subregion. This loop subregion will provide the key to 
the proof, because we will show that it prevents the distance between v and w from 
becoming 2, as assumed (Fig. 5). 


Remark 1 As an aside, we mention that this issue of biconnected graphs is a major 
issue only in the proof of the Kepler conjecture and not in the proof of the dodecahe- 
dral conjecture [24]. In the proof of the dodecahedral conjecture, there are no loops 
of anchored simplices, and without loops there are no difficulties: edges of length at 
most v8 can be used instead of the set E’. Lemmas 2 and 3 suffice. 
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8.3 Case (a): Edge Crossings with Distinct Endpoints 


Lemma 4 There does not exist a set S = {0, v, w, uy, U2, UU} of seven distinct 
points in R? whose pairwise distances are at least 2 and that satisfies the following 
conditions. 


The edges {u,,u2} and {u’,,u'} do not cross. 

The edges {u,, uz} and {u',, v5} both pass through {0, v, w}. 

|u|] < 2.51 forallue S. 

|v — wl] =2. 

ey — wa, Nal, — we] $3.2. 

uy and u', lie in the same half-space bounded by the plane {0, v, w}. 

The directed segment from v to w crosses the segment {u',, u',} before the segment 
{u1, U2}. 


Note that the last two conditions can always be achieved by suitable labels on the 
points {w1,u2,u},u5}. 


Proof This will follow as a direct consequence of Lemmas 6 and 8 below. 


Lemma 5 Let S = {u, v, w} be a triangle such that each side has length at least 2, 
and such that \\u — v|| < 2.51, ||u — w|| < 2.51, ||v — w|| = 2. Let X be the set of 
points in the convex hull of S that have distance at least 1.2 from each vertex of S. 
Then the diameter of X is less than 1.044. 


Proof Let x = ||u — v|| and y = ||u — w||. Assume x > y. As u moves away from v, 
along a fixed line through v and an initial position ug, the region X expands. Thus, 
we may assume that x = 2.51. The boundary of X is a polygonal curve consisting of 
line segments and concave arcs of circles. The diameter is realized by the distance be- 
tween two vertices p;(y) of the polygonal curve. We consider two cases according to 
y <2.4 and y > 2.4 because the structure of the polygonal curve changes at y = 2.4. 
We calculate || p;(y) — p;(y)|| directly, checking for each (i, j) that the distances are 
less than 1.044. 


Lemma 6 Let S = {0, v, w, 1, U2, u} 5 us} CR? bea configuration of seven points 
that satisfies the conditions of Lemma 4. Then there exist a point p on the segment 
{u1, U2} and a point p' on the segment {u',, u} such that ||p — p'|| < 1.044. 


Proof Let u € {0, v, w}. By the metric constraints, the distance from u to the segment 
{u1, U2} (resp. {u', u5}) is at least 


4/22 — (3.2/2)? = 1.2. 


Let p (resp. p’) be the point of intersection of the segment {w1,u2} (resp. {u', u5}) 
with the convex hull of {0, v, w}. By Lemma 5, we have || p — p’|| < 1.044. 
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Lemma 7 Let S = {u1, us u2, us} C R3 bea set of four distinct points such that the 
distance between each pair of points is at least 2. Assume that 


||) — uo] < 3.2, [|e — v5 || < 3.2. 


If any of the following conditions hold: 


(A) [lui — v5], [lu — val] = 2.91 
(B) |lu1 — wi |, luz — w5|| = 2.85, |]u1 — u5|| = 2.91; or 
(C) llui — ul = 3.64, l]ui — w5|| = 2.91 


then every point on the segment {u,, U2} has distance greater than 1.044 from every 
point on the segment {u', u}}. 


Proof Assume for a contradiction that the assumptions hold and that the conclu- 
sion is false for some configuration. The metric constraints can be used to show 
that the segments {u;, v2} and {u', us} can be stretched along their axes without 
decreasing any edge length. Thus we may assume without loss of generality that 
\|u1 — u2|| = |v’, — u’|| = 3.2. Decreasing one dihedral angle of the simplex S at a 
time, we may move the segments closer together, until all four edges {u;, ui} attain 
their minimum length. Then we have three rigidly determined simplices (A), (B), (C) 
(with equality constraints). An explicit coordinate calculation of the distance between 
the two segments shows that the distance is greater than 1.044 in each case. 


Lemma 8 Let S = {0, v, w, u1, “2, u} F us} CR bea configuration of seven points 
that satisfies the conditions of Lemma 4. Then there do not exist a point p on the 
segment {u, U2} and a point p' on the segment {u',, u',} such that || p — p’|| < 1.044. 


Proof Assume for a contradiction that such p, p’ exist. In Lemma 7, we may assume 
that none of the conditions A, B, C hold. By obvious symmetry, without loss of 
generality, we may assume by case A that ||w2 — u'|| < 2.91. By Lemma 3, this 
implies that {u',, u2} does not pass through {0, v, w}. By the conditions of the lemma, 
the segment {u1, u'} does not meet the plane {0, v, w}, so that {u, u'} does not pass 
through {0, v, w}. This means that the triangle {w1, Ui uz} is linked around {0, v, w}, 
and some edge of {0, v, w} passes through {u1, w,, v2}. Up to symmetry there are two 
cases: (1) {0, v} passes through {u1, uy uz}, or (2) {v, w} passes through {u1, U, u2}. 

In the first case, recall that {u’, v5} does not cross {u1, u2}. So wu, is enclosed over 
(0, {u1,u,,u2}). Then {u,,u,} and {w4,u'} pass through {0, v, w}. By Lemma 3, 
we have ||u — w1||, lu, — u)|| = 2.91. As we are assuming that C does not hold, 
we have ||; — w/|| < 3.64. We claim that {0, u,} passes through {w1, u'}, v2}. Other- 
wise, uy lies in the convex hull of {0, uw, us uz} and a coordinate calculation shows 
that the upper bounds on the edges of the simplex {0, 1, u',, 2} are inconsistent 
with the lower bounds on the distances from uv‘, to the vertices of the simplex. Since 
{0, w,} passes through {w1,u‘,u2}, we may use geometric considerations to show 
that ||w’,|| > 2.51. This is contrary to the hypotheses of Lemma 4. 

In the second case, {v, w} passes through both {u1, uw, u2} and {u',, u,, u2}. Geo- 
metric considerations give ||“ — u‘|| > 2.85 and ||u2 — u4|| > 2.85. Assuming that 
B does not hold gives ||u; — u4|| < 2.91. The first paragraph of the proof now gives 
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that {u2, Us, uj} links around the triangle {0, v, w}. The edge {v, w} does not pass 
through {u2, u,v}. (This can be seen by drawing the relative positions of p(w) for 
u € S in the projection p of the points to a plane orthogonal to {v, w}.) Thus, we are 
in the first case, which has already been treated. 


8.4 Case (b): Triple Edge Crossings 


By Lemma 4, there is a common endpoint uz such that every edge of E’ that passes 
through {0, v, w} has w2 as an endpoint. Next we show that there cannot be three such 
edges. 


Lemma 9 There does not exist a set of seven distinct points 
/ My 
i {0, V,W, Uy, Uy, U), uz} 


in R? that satisfies the following conditions. 


e The distance between each pair of distinct points in S is at least 2. 
The edges {uj, uz}, {u,, U2}, and {u'), u2} pass through {0, v, w}. 
|u|| < 2.51 forallue S. 

|v — w|| = 2. 

lu — u2|| < 3.2 foru=u4, uu). 


Proof Assume for a contradiction that S exists. We may pivot w around the axis 
{0, v} until ||w — w2|| < 2.51. (The metric constraints on edge lengths show that the 
condition A > 0 is preserved for the simplices {u, u2, v, w} and {u, u2, w, 0} for u = 
uj,u), ui throughout this pivot.) A similar pivot of v gives ||v — u2|| < 2.51. We may 
order the vertices in cyclic order around {0, u2} as 


(w1, w2, w3, wa, Ws) = (w, uu), uy, v), 
so that setting d(w;, w;) = dih(0, u2, w;, w;), we have 
4 
d(w, ws) =) d(wi, wi+1) = d(w, w3) + d(w3, wa). 
i=1 


Interval calculations” give d(w2, w3), d(w3, w4) => 0.7 and d(w 1, ws) < 1.4. We ob- 
tain an immediate contradiction: 


1.4>d(wy, ws) > 0.7 +0.7. 


2CALC-2799256461, CALC-5470795818. 
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Fig. 6 The upright diagonal u 
cannot be placed over any of the 
regions A, B, C, D. The lines 
(not to scale) represent geodesic 
arcs on the sphere passing 
through the pairs of points in 
{p(u1), pu), p(v)}, where p 
denotes projection to the unit 
sphere 


8.5 Case (c): Double Edge Crossings 


This subsection treats the case of two edges crossings in the proof of Theorem 3. We 
continue to assume the general context of Theorem 3. As usual, the edge {u), v2} € E’ 
crosses {v, w}. 


Lemma 10 Let {u;,u2,w,v} be a set of four distinct points in R? (in the given 
context). Assume that ||v — u;\| < 2.51 for i = 1,2. Assume that no edge of E' crosses 
{v, w} in the open half-space A containing v bounded by the plane {0, uy, u2}. (That 
is, {u,, U2} is the first edge to cross {v, w}, moving from v toward w.) Assume that 
there is a loop subregion L along {u,, U2} on the A-side of {u,, u2}. Then {u1, u2} 
is a special edge of E' with corner v. In particular, {u,,u2} ¢ E, so that it is not a 
bounding edge of a subregion. 


Proof Assume for a contradiction that {uv , u2} is not special. Note that loop subre- 
gions have simple polygonal boundaries and remain rigid under all the deformations. 
In particular, the upright diagonal, special corners, and so forth remain rigidly posi- 
tioned with respect to the corners of the subregion. 

Since there are no further edges crossing {v, w}, the subregion L extends to v. 
Hence v is acorner of the subregion L. It is either an anchor or a special corner (with 
respect to L). However, it cannot be a special corner, by the assumption that {w, u2} 
is not a special edge. Hence it is an anchor. Also, wu; and wz are anchors. 

To reach a contradiction, we consider possible locations of the upright diago- 
nal {0,u} and show that it has nowhere to go (Fig. 6). Since {u1, v2} is an edge, 
the points uw1,u2 are consecutive anchors. This prevents u from lying over the re- 
gion B. Also, the upright diagonal of any unerased loop has at least four anchors 
(say U,U1,U2, w’). Moreover, some anchored simplex around the upright diagonal 
is not an upright quarter (because of the edge {uw}, u2}). Geometric considerations 
based on these constraints (say ||w’ — u1|| > 2.51) show that the fourth anchor w’ is 
not in A. This prevents u from being located over the region A. A vertex u; cannot 
be enclosed over an upright quarter {0, u, v, u;}. This excludes region C. Finally, an 
edge {v, u;} of length at most 2.51 cannot pass through a triangle {0, u, u;} of sides 
at most 2.51, 2.51, 8 (excluding D). 


Lemma 11 Let {u1,u2,w,v} be a set of four distinct points in R? (in the given 
context). Assume that ||v — u;|| < 2.51 fori = 1,2. Assume that no edge of E crosses 
{u, w} in the open half-space A containing v bounded by the plane {0, u,, u2}. (That 
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is, {u1,u2} is the first edge to cross {v,  w}, moving from v toward w.) Then both 
edges {u,, v} and {uz, v} belong to E. In particular, there is a circuit of the graph 
through v, uy, U2. 


Proof Vf there is a loop subregion L along {uj,u2} on the A-side of {u1, u2}, 
Lemma 10 implies that {w;,u2} is a special edge of E’ with corner v. In particu- 
lar, {u;, v} and {u2, v} are edges of E. The conclusion follows in this case. 

Now assume that there is no loop subregion along {u;,u2} on the A-side of 
{u1, U2}. 

Let S be the finite set of points of U enclosed over the simplex {0, u1, u2, v}. We 
show by contradiction that S is empty. The plane {0, v, w} separates S into a disjoint 
union S = S$; U S2, according to those in the same half-space as u;, i = 1, 2. We form 
the convex hull of the projection p to the unit sphere of the points S; U {u;, v}. As 
in [15, Sect. 12.13], form a sequence of geodesic arcs on the unit sphere from p(u;) 
to p(v). Let p(w;), for w; € S;, be the other endpoint of the arc starting at p(u;) (or 
set w; =v if S; =. For some w’ € {w, w2} \ {vu}, the edges {w’, u;} do not cross 
any edges of E. Furthermore, geometric considerations show that || w’ — u;|| < 2.51 
for i = 1,2. By the criteria for forming edges of E, we must have {w’, u;} € E for 
i = 1,2. This contradicts the assumption that {v, w} does not cross any edges of E 
over A. Hence S = 9. 

Since S = J, the edges {v, u;} do not cross any edges of E. By the criteria for 
forming edges of E, they belong to E. This completes the proof. 


We are ready to prove the next major case of Theorem 3. We continue to work in 
the general context of that theorem, with v and w in different bicomponents of the 
graph I". 


Lemma 12 In this context, no set of six points {0,v,w,u1, u',, U2} exists, where 
{u,,u2} and {u', u2} pass through {0, v, w} and such that 


||u1 — ual] < 3.2, |u, — ual] <3.2. 


Proof We argue by contradiction. We may assume that {w1,u‘} are ordered so that 
the cyclic order around {0, u2} is (w1, w2, w3, w4) = (Vv, ui} ,U,, w). By the previous 
results, there are at most two edges that pass through {0, v, w} in this manner. In 
particular, the part of the line segment {v, w} between the crossings of {uw1, u2} and 
{u',, U2} lies in a single subregion L. 

We claim that we do not have ||v — uv‘ ||, ||v — wall, |lw — will, lw — ual] < 2.51. 
Otherwise, we break into two cases and derive a contradiction as follows. Either (1) 
L is a loop subregion, or (2) L is not a loop subregion, but both regions adjacent to 
L (along edges {u1, u2}, {u', u2}) are loop subregions. In case (1), {w1, v2, ui} are 
corners of the loop subregion L. Hence, they lie on a circuit in J” formed by the 
corners of that loop subregion. If |v — u2||, ||v — v4 || < 2.51, then by Lemma 11, the 
points {v, uv}, v2} lie on a circuit in I’. A similar conclusion holds if corresponding 
inequalities hold || w — ual, ||w — uy || < 2.51. If all four inequalities hold, then these 
circuits put v, w in the same bicomponent of J”, which is contrary to hypothesis. 
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In case (2), then by Lemma 11, {v, uw}, {v, u\} € E, so {u', uz} is a special edge 
and L is a loop subregion. This is contrary to the assumption of case (2). Hence we 
may assume by symmetry and without loss of generality that ||v — u‘|| = 2.51 or 
|v — u2|| > 2.51. 

We may stretch along the edges {u2, ui}, {u2,u}, moving uj, u‘, until ||v2 — 
u4|| = ||u2 — u{ || = 3.2. We may add the inequality 


I[ual| < 2.23, 


for otherwise by geometric considerations ||u2 — u{|| > 3.2. Similarly, |u|] < 
2.23. If ||w2 — v|| => 2.51, we may pivot v toward w2 around the axis {0, w} until 
||u2 — v|| < 2.51. Similarly, we may assume that ||u2 — w|| < 2.51. Set d(i, j) = 
dih(0, uz, w;, w;). Then interval arithmetic calculations? give the contradiction: 


1.3>d(,4) =d(, 2) + d(2, 3) + d(3,4) > 0.5+0.8+0= 1.3. 
8.6 Case (d): Single Edge Crossings 


This subsection treats the proof of Theorem 3 in the case of a single edge crossing 
{u,, U2}. This is the final case of the proof. We continue to assume the notation and 
general context of that theorem. In particular, v and w lie in different bicomponents 
of the graph I’. 


Lemma 13 Let {0, uj, u2, v, w} be a set of five distinct points such that {u,, uz} is 
the only edge of E' that crosses {v, w}. Then ||v — w|| > 2. 


Proof We assume for a contradiction that ||v — w|| = 2. We consider four cases de- 
pending on lengths. 

Case 1: ||u—uj|| < 2.51 fori = 1,2 andu = v, w. By Lemma 11, there are circuits 
running through {u, v1, u2} for u = v, w. This is contrary to the assumption that v, w 
lie in different bicomponents of the graph J”. (In the remaining cases, there is no loss 
in generality to assume ||w — u2|| > 2.51.) 

Case 2: ||w — u2|| => 2.51, ||v — u1|| => 2.51. Geometric considerations give the 
contradiction ||u, — u2|| > 3.2. 

Case 3: ||w — u2|| > 2.51, ||v — u2|| > 2.51. Geometric considerations gives the 
contradiction ||u, — u2|| > 3.2. 

Case 4: ||w — u2|| > 2.51, ||v — u; || < 2.51 for i = 1, 2. The edge {uw 1, v2} cannot 
be a special edge of E’. Otherwise, v, w are corners of the same loop subregion. This 
contradicts the running assumption that these two vertices belong to separate bicom- 
ponents of the graph I’. By Lemma 10, there is no loop subregion along {u1, u2} 
on the v-side. Since {uw ,u2} has length greater than V8, there is a loop subregion 
L bounded by the edge, and it must then lie on the w-side. Thus, w is a corner of 
L, and the circuit of I” described by the boundary of L passes through w, uw, u2. 
By Lemma 11, there is a circuit of I through v, v1, v2. Hence, v, w lie in the same 
biconnected component, which is contrary to the running assumption. 


3CALC-743 1506800, CALC-5568465464, CALC-4741571261, CALC-6915275259. 
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9 Errata Listing 


The abridged version of the Kepler conjecture in the Annals [12] was generated by 
the same TeX files as the unabridged version in [23]. For this reason, it seems that 
every correction to the abridged version should also be a correction to the unabridged 
version. We list errata in the unabridged version. The same list applies to correspond- 
ing passages in the abridged version. 

Each correction gives its location in [23]. The location + counts down from the 
top of the page, or if a section or lemma number is provided, it counts from the top 
of that organizational unit. The location £ — n counts up from the bottom of the page. 
Footnotes are not included in the count from the bottom. Every line containing text 
of any sort is included in the count, including displayed equations, section headings, 
and so forth. The material to the left of ~» indicates original text, and material to 
the right of the arrow gives replacement text. The original text and replacement text 
appear in italic. Comments about the corrections appear in roman. 

In addition to the corrections to the text mentioned below, there have been some 
corrections to the computer code, including some typos in the listings of nonlinear 
inequalities. They are described in detail in [9]. 


9.1 Listing 


[p. 47, Lemma 5.16] Q ~» F 

[p. 49, €+ 2] supposed ~~» suppose 

[p. 63, Lemma 7.10] S-system ~» Q-system 

[p. 75, Remark 8.11] show ~ shows 

[p. 78, € — 7] constraints ~ constraint 

[p. 86, £+ 14] Let {0, v} be the diagonal of an upright quarter in the Qg ~» Let v be a vertex 
with 2tg < |v| < /8. Section 9 assumes that the diagonal belongs to a quarter in the Q-system, 
but Lemma 10.14 uses these results when {0, v} has 0 or 1 anchors. To make this coherent, we 
should assume throughout Sect. 9 that we have the weaker condition that whenever {0, v} has 
two or more anchors, it is a diagonal of a quarter in the Q-system. The proofs of Sect. 9 all go 
through in this context. (Lemma 9.7 is all that is relevant here.) 

[p. 87, Definition 9.3] In definition of A(v, W°), we can have some Q (as in Fig 9.1) with 
negative orientation. In this case, Ey M Ej; can clip the other side. We want the object without 
clipping. A(v, W®) should be understood as the unclipped object. 

[p. 88, Definition 9.6] The definition is poorly worded. First of all, it requires that the subscript 
to € to be a vertex, but then in the displayed equation, it makes w/2 the subscript, which is not 
a vertex. To define e’, move from w/2 along the ray through x’ until an edge of the Voronoi 
cell is encountered. If v, w, u are the three vertices defining that edge, then set en (A, x= Uu. 
Degenerate cases, such as when two different edges are encountered at the same time, can 
be resolved in any consistent fashion. In [21], these degeneracies are avoided altogether, by 
replacing functions €, «’ with sets &, &’. 

[p. 88, Lemma 9.7, £+ 2] w and v~ w and u 

[p. 88, L. 9.7, Claim 1] with |w — w'| < 2t9, and ~» with 

[p. 88, L. 9.7, €+5] Then: ~» Let Ri, = {x € Rw 1 (0, {u, w}) | eo (x, {u, w}) = uj. Assume 
that Rj, is not empty. Then: 

[p. 88, L. 9.7, Claim 3] Ry ~» Ri, 

[p. 89, 2+ 2] {w,v} ~ {w,u} 

[p. 92, £+ 16, €+ 21] max; uj; ~ max; |v; 
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[p. 93, € — 4] obstructed from w ~+ obstructed from w! 

[p. 93, € — 2] from some ~~» for some 

[p. 99, €+ 1] start ~» star 

[p. 105, Lemma 10.14] In the proof of the cases involving 0 or 1 anchor, a combination of the 
decompositions from Sects. 8.4 and 9 are used. These decompositions have not been shown to 
be compatible. Instead, it is better to combine A(v, W) with fg-truncation on the rest of the 
quad-cluster. With a fg truncation, we no longer have the nonpositivity results from Sect. 8. 
(The quoins give a positive contribution.) However, a routine calculation shows that the esti- 
mate on A(v, W) is sufficiently small so that we still obtain a constant less than —1.04 pt. 

{p. 116] [p. 121] Definition 11.7 allows masked flat in definition of 3-unconfined. Defini- 
tion 11.24 requires no masked flats in the same definition. Use Definition 11.24 (no masked 
flats), rather than 11.7. Where masked flats occur, treat them with Lemma 11.23, parts (1) 
and (2). 

[p. 116, 2+ 1] Lemma 4.16 ~» Lemma 4.17 

[p. 117, before Lemma 11.9] two others ~» three others 

[p. 117, Definition 11.8] yl ~ yy, 

[p. 119, Definition 11.5] By definition, we require a masked flat quarter to be a strict quarter. 
[p. 121] See p. 116. 

[p. 121, € — 5] 0.2274 ~» 0.02274 

[p. 123, flat case (2)] It is missing isolated quarters cut from the side. To fix this, in condition 
2), nas6 = V2 ~» nas6 = V2 or 1234 = V2 

[p. 124, Lemma 11.27] The bound of 0 is established in Theorem 8.4 ~» The bound of 0 for up- 
right quarters appears in Lemma 8.12. The bound of 0 for the other anchored simplex appears 
in Lemma 8.7 or 8.13, depending on the circumradius. 

[p. 126] Theorem 12.1 should include or(D) < sy, with s3 = 1 pt and s4 = 0, and tr(D) > 
3=0. 

[p. 131] Section 8 gives the deformation arguments that produce a biconnected graph. 

[p. 139, Lemma 12.18, proof, + 3] Co(\v|, 2) ~ Co (lvl, 2) 

[p. 139, Lemma 12.18, proof, @ + 6] TO(CH (2tg, )) — Tmax ~* T(Ch (2.2, 1)) — Tmax 

[p. 144, €+ 11, £+ 17] 242 ~» (219)? 

[p. 146] S, ~» of 3-crowded, 3-undefined, and 4-crowded combinations 

[p. 148, Sect. 13.6] This entire section is misplaced. It belongs with Sects. 25.5 and 25.6. 

[p. 149, before 13.7] the diagrams ~» Figs. 25.1—25.4 

[p. 149, p. 156] The definition of 5}g9p was accidentally dropped from the published version. 
Set djoop (4, 2) = 0.12034, djoop (5, 1) = 0.24939. These constants and their properties appear 
in the earlier 2002 arXiv preprint of the proof The Kepler conjecture (Sphere Packings VI). 

[p. 156, Lemma 13.5, € + 4] respectively for tr(D) ~» respectively, for op(D) and trR(D) 
[p. 164, £ — 1] This shows... occur. ~ This completes the proof. 

[p. 173, €+ 4] Insert the subscript on b, as in Proposition 15.5, starting on p. 173: b ~» bg. 
[p. 182, Lemma 16.7] The bound of 0 has not been shown to hold on each half. This is not 
a direct consequence of Theorem 8.4 as claimed. This can be fixed as follows. Let vj, v3 be 
the corners giving the endpoints of the long edge of the acute triangle at 0, and let v2, v4 
be the other two corners. If either vertex vj, v3 has height greater than 2.3, show that the 
vorg-scored quad cluster scores less than —1.04 pt. For this, we may use the deformations of 
Lemmas 12.10. The length of the diagonal along the acute face remains fixed and at least 
./8. We claim that these deformations produce a diagonal of length less than /8 between 
opposite corners of the quadrilateral. (If not, the deformations produce a rhombus with side 2 
and diagonals both greater than 8, which is a geometric impossibility.) We cut the quad 
cluster along the diagonal of length /8 and continue with deformations until the top edges 
on each simplex are (y4, v5, yo) = (2, 2, /8). We may apply CALC-474496219 and CALC- 
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8990938295 to the two separate simplices to obtain the result. Now we may assume that 
the heights of vj, v3 are at most 2.3. If either height is at least 2.1, the result follows from 
CALC-5127197465, which gives the bound of 0 on each half. 

Finally, we have the case where both heights are at most 2.1. We may apply dimension- 
reduction techniques so that the each of the two remaining comers v # v1, v3 of the quad 
cluster either has height 2 or has distance 2 from vj or v3. We then reprove Lemma 16.8 with- 
out using the bound of 0 and Lemma 16.9 for tetrahedra without the bound —1.04 pt. This 
appears in CALC-1551562505 and CALC-3013446042. 

If the dimension reduction drops the cross-diagonal {v2, v4} all the way to ./8, then we may 
swap diagonals and continue, until both diagonals are exactly \/8. In this case, by the cases 
already considered, we may assume that each corner has height at most 2.1. Also, geometric 
considerations give that the other edges are at most 2.02: 


E(2, 2,2, V8, 2.1, 2.1, 2,2, 2.02) > V8. 


The result follows in this case by CALC-4723770703. 

[p. 241] Mixed is defined so as to include the pure analytic case. In earlier articles, “mixed” 
excludes the pure analytic. mixed ~~ mixed or pure 

[p. 243, €+ 13, €+ 14, 2+ 15] Delete three sentences: ‘Let vj2 be ... We let ... Break the 
pentagon...” 

[p. 248, last displayed formula] = ~» + so that it reads 


S~ fr, (D) < & (Qj) + vorg’o(D) +7. 


U 


[p. 252, Sect. 25.7, Cases 2 and 3] “The flat quarter” is mentioned, but there are no flat quarters 
that have been introduced into the context. This passage has been displaced from its original 
context. 


[p. 254, £+ 7] to branch combine ~~ to combine 
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Part IV 


Initial Papers of the Hales Program 


a1] 


Introduction to Part IV 


Part IV presents two initial papers of T. C. Hales published in 1997, establishing 
his approach for proving the Kepler Conjecture. These are T. C. Hales, “Sphere 
Packings I” (Discrete Comput. Geom., 17 (1997), 1-51) and T. C. Hales, “Sphere 
Packings II” (Discrete Comput. Geom., 18 (1997), 135-149). 


The general approach outlined in these papers had to be modified in the eventual 
proof, as explained in the first of the six papers in Part II. The final proof was then 
rewritten to be logically independent of these papers. These papers contain crucial 
ideas suggesting the form of the final proof. 
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Sphere Packings I, by T. C. Hales 


This 1997 paper is Hales’s first paper on the Kepler Conjecture project, outlining his 
initial proof approach. 
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Sphere Packings, I 


T. C. Hales 
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hales@umich.edu 


Abstract. We describe a program to prove the Kepler conjecture on sphere packings. We 
then carry out the first step of this program. Each packing determines a decomposition of 
space into Delaunay simplices, which are grouped together into finite configurations called 
Delaunay stars. A score, which is related to the density of packings, is assigned to each 
Delaunay star. We conjecture that the score of every Delaunay star is at most the score of the 
stars in the face-centered cubic and hexagonal close packings. This conjecture implies the 
Kepler conjecture. To complete the first step of the program, we show that every Delaunay 
star that satisfies a certain regularity condition satisfies the conjecture. 


1. Introduction 


The Kepler conjecture asserts that no packing of spheres in three dimensions has density 
exceeding that of the face-centered cubic lattice packing. This density is 7//18 ~ 
0.74048. In an earlier paper [H2] we showed how to reduce the Kepler conjecture to 
a finite calculation. That paper also gave numerical evidence in support of the method 
and conjecture. This finite calculation is a series of optimization problems involving up 
to 53 spheres in an explicit compact region of Euclidean space. Computers have little 
difficulty in treating problems of this size numerically, but a naive attempt to make a 
thorough study of the possible arrangements of these spheres would quickly exhaust the 
world’s computer resources. 

The first purpose of this paper is to describe a program designed to give a rigorous proof 
of the Kepler conjecture. A sketch of a related program appears in [H2]. Although the 
approach of [H2] is based on substantial numerical evidence, some of the constructions of 
that paper are needlessly complicated. This paper streamlines some of those constructions 
and replaces others with constructions that are more amenable to rigorous methods. For 
this program to succeed, the original optimization problem must be partitioned into a 
series of much smaller problems that may be treated by current computer technology or 
hand calculation. 
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The second purpose of this paper is to carry out the first step of the proposed program. 
A statement of the result is contained in Theorem | below. 

Background to another approach to this problem is found in [H3]. To add more detail 
to the proposed program, we recall some constructions from earlier papers [H1], [H2]. 
Begin with a packing of nonoverlapping spheres of radius | in Euclidean three-space. 
The density of a packing is defined in [H1]. It is defined as a limit of the ratio of the 
volume of the unit balls in a large region of space to the volume of the large region. The 
density of the packing may be improved by adding spheres until there is no further room 
to do so. The resulting packing is said to be saturated. It has the property that no point 
in space has distance greater than 2 from the center of some sphere. 

Every saturated packing gives rise to a decomposition of space into simplices called the 
Delaunay decomposition. The vertices of each Delaunay simplex are centers of spheres 
of the packing. None of the centers of the spheres of the packing lie in the interior of 
the circumscribing sphere of any Delaunay simplex. In fact, this property is enough to 
determine the Delaunay decomposition completely except for certain degenerate pack- 
ings. A degeneracy occurs, for instance, when two Delaunay simplices have the same 
circumscribing sphere. In practice, these degeneracies are important, because they occur 
in the face-centered cubic and hexagonal close packings. The paper [H2] shows how to 
resolve the degeneracies by taking a small perturbation of the packing. In general, the 
Delaunay decomposition will depend on this perturbation. We refer to the centers of the 
packing as vertices, since the structure of the simplicial decomposition of space is our 
primary concern. For a proof that the Delaunay decomposition is a dissection of space 
into simplices, we refer the reader to [R]. 

The Delaunay decomposition is dual to the well-known Voronoi decomposition. If the 
vertices of the Delaunay simplices are in nondegenerate position, two vertices are joined 
by an edge exactly when the two corresponding Voronoi cells share a face, three vertices 
form a face exactly when the three Voronoi cells share an edge, and four vertices form 
a simplex exactly when the four corresponding Voronoi cells share a vertex. In other 
words, two vertices are joined by an edge if they lie on a sphere that does not contain any 
other of the vertices, and so forth (again assuming the vertices to be in nondegenerate 
position). The collection of all simplices that share a given vertex is called a Delaunay 
star. (This is a provisional definition: it is refined below.) 

Every Delaunay simplex has edges between 2 and 4 in length and, because of the 
saturation of the packing, a circumradius of at most 2. We assume that every simplex S 
in this paper comes with a fixed order on its edges, 1, ..., 6. The order on the edges is 
to be arranged so that the first, second, and third edges meet at a vertex. We may also 
assume that the edges numbered i andi + 3 are opposite edges fori = 1, 2, 3. We define 
S(y1, ..-, Ye) to be the (ordered) simplex whose ith edge has length y;. If S is a Delaunay 
simplex in a fixed Delaunay star, then it has a distinguished vertex, the vertex common 
to all simplices in the star. In this situation we assume that the edges are numbered so 
that the first, second, and third edges meet at the distinguished vertex. 

A function, known as the compression T'(S), is defined on the space of all Delaunay 
simplices. Let 54. = (—3a + 12 arecos(1/./3))//8 * 0.720903 be the density of a 
regular octahedron with edges of length 2. That is, place a unit ball at each vertex of the 
octahedron, and let 5,,; be the ratio of the volume of the part of the balls in the octahedron 
to the volume of the octahedron. Let S be a Delaunay simplex. Let B be the union of 
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four unit balls placed at each of the vertices of S. Define the compression as 
TCS) = —doe vol(S) + vol(S ON B). 


We extend the definition of compression to Delaunay stars D* by setting ['(D*) = 
>- T'(S), with the sum running over all the Delaunay simplices in the star. 

In this and subsequent work, we single out for special treatment the edges of length 
between 2 and 2.51. The constant 2.51 was determined experimentally to have a number 
of desirable properties. This constant appears throughout the paper. We call vertices that 
come within 2.51 of each other close neighbors. 

We say that the convex hull of four vertices is a quasi-regular tetrahedron (or simply a 
tetrahedron) if all four vertices are close neighbors of one another. Suppose that we have 
a configuration of six vertices in bijection with the vertices of an octahedron with the 
property that two vertices are close neighbors if and only if the corresponding vertices of 
the octahedron are adjacent. Suppose further that exactly one of the three diagonals has 
length at most 2/2. In this case we call the convex hull of the six vertices a quasi-regular 
octahedron (or simply an octahedron). 

The compatibility of quasi-regular tetrahedra and octahedra with the Delaunay de- 
composition is established in Section 3. We think of Euclidean space as the union of 
quasi-regular tetrahedra, octahedra, and various less-interesting Delaunay simplices. 
From now on, a Delaunay star is to be the collection of all quasi-regular tetrahedra, 
octahedra, and Delaunay simplices that share a common vertex v. This collection of 
Delaunay simplices and quasi-regular solids is often, but not always, the same as the 
objects called Delaunay stars in [H2]. We warn the reader of this shift in terminology. 

It is convenient to measure the compression in multiples of the compression of the reg- 
ular simplex of edge length 2. We define a point (abbreviated pt) to be '($(2,2,2,2,2,2)). 
We have pt = 11/3 — 12 arecos(1//3) © 0.0553736. 

One of the main purposes of this paper and its sequel is to replace the compression 
by a function (called the score) that has better properties than the compression. Further 
details on the definition of score appear in Section 2. We are now able to state the main 
theorem of this paper. 


Theorem 1. Jfa Delaunay star is composed entirely of quasi-regular tetrahedra, then 
its score is less than 8 pt. 


The idea of the proof is the following. Consider the unit sphere whose center is the 
center of the Delaunay star D*. The intersection of a simplex in D* with this unit sphere 
is a spherical triangle. For example, a regular tetrahedron with edges of length 2 gives 
a triangle on the unit sphere of arc length 77/3. The star D* gives a triangulation of the 
unit sphere. The restriction on the lengths of the edges of a quasi-regular tetrahedron 
constrains the triangles in the triangulation. We classify all triangulations that potentially 
come from a star scoring more than 8 pt. Section 5 develops a long list of properties that 
must be satisfied by the triangulation of a high-scoring star. 

It is then necessary to classify all the triangulations that possess the properties on this 
list. The original classification was carried out by a computer program, which generated 
all potential triangulations and checked them against the list. D. J. Muder has made a 
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significant improvement in the argument by giving a direct, computer-free classification. 
His result appears in the Appendix. 

As it turns out, there is only one triangulation that satisfies all of the properties on 
the list. Section 7 proves that Delaunay stars with this triangulation score less than 8 pf. 
This will complete the main thread of the argument. 

There are a number of estimates in this paper that are established by computer. These 
estimates are used throughout the paper, even though their proofs are not discussed until 
Sections 8 and 9. These sections may be viewed as a series of technical appendices 
giving explicit formulas for the compression, dihedral angles, solid angles, volumes, and 
other quantities that must be estimated. The final section states the inequalities and gives 
details about the computerized verification. There is no vicious circle here: the results 
of Sections 8 and 9 do not rely on anything from Sections 2-7. 

There are several functions of a Delaunay simplex that are used throughout this 
paper. The compression I'(S') has been defined above. The dihedral angle dih(S) is 
defined to be the dihedral angle of the simplex S along the first edge (with respect to 
the fixed order on the edges of S). Set dihmin = dih(S(2, 2, 2.51, 2,2, 2.51)) © 0.8639; 
dihmax = dih(S(2.51, 2,2, 2.51, 2, 2)) = arccos(—29003/96999) ~ 1.874444. We will 
see that dihmin and dihyax are lower and upper bounds on the dihedral angles of quasi- 
regular tetrahedra. The solid angle (measured in steradians) at the vertex joining the 
first, second, and third edges is denoted sol(S). The intersection of S with the ball of 
unit radius centered at this vertex has volume sol(S)/3 (see [H1, 2.1]). For example, 
sol(S(2, 2,2,2,2,2)) ~ 0.55. Let rad(S) be the circumradius of the simplex S. In 
Section 2 we define two other functions: vor(S), which is related to the volume of 
Voronoi cells, and the score o(S). Finally, let n(a, b, c) denote the circumradius of a 
triangle with edges a, b, c. Explicit formulas for all these functions appear in Section 8. 


2. The Program 


By proving Theorem 1, the main purpose of this paper will be achieved. Nevertheless, 
it might be helpful to give a series of comments about how Theorem | may be viewed 
as the solution to the first of a handful of optimization problems that would collectively 
provide a solution to the Kepler conjecture. 

We begin with some notation and terminology. We fix a Delaunay star D* about a 
vertex vo, which we take to be the origin, and we consider the unit sphere at vo. Let vj 
and v2 be vertices of D* such that vg, v;, and v2 are all close neighbors of one another. 
We take the radial projections p; of v; to the unit sphere with center at the origin and 
connect the points p; and p2 by a geodesic arc on the sphere. We mark all such arcs 
on the unit sphere. Lemma 3.10 shows that the arcs meet only at their endpoints. The 
closures of the connected components of the complement of these arcs are regions on the 
unit sphere, called the standard regions. We may remove the arcs that do not bound one 
of the regions. The resulting system of edges and regions is referred to as the standard 
decomposition of the unit sphere. 

Let C be the cone with vertex vp over one of the standard regions. The collection of 
the Delaunay simplices, quasi-regular tetrahedra, and quasi-regular octahedra of D* in 
C (together with the distinguished vertex vg) is called a standard cluster. Each Delau- 
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nay simplex in D* belongs to a unique standard cluster. Each triangle in the standard 
decomposition of the unit sphere is associated with a unique quasi-regular tetrahedron, 
and each tetrahedron determines a triangle in the standard decomposition (Lemma 3.7). 
We may identify quasi-regular tetrahedra with clusters over triangular regions. 

We assign a score to each standard cluster in [H4, 3]. In this section we define the 
score of a quasi-regular tetrahedron and describe the properties that the score should 
have in general. 

Let S be a quasi-regular tetrahedron. It is a standard cluster in a Delaunay star with 
center vo. If the circumradius of S is at most 1.41, then we define the score to be I'(S). 

If the circumradius is greater than 1.41, then embed the simplex S in Euclidean three- 
space. Partition Euclidean space into four infinite regions (infinite Voronoi cells) by 
associating with each vertex of S the points of space closest to that vertex. By intersecting 
S with each of the four regions, we partition S into four pieces Sos Si S, and &.. 
corresponding to its four vertices vp, Vv), V2, and v3. Let sol; be the solid angle at the vertex 
v; of the simplex. The expression —46,¢; vol(Sp) +4 solo/3 is an analytic function of the 
lengths of the edges for simplices S$ that contain their circumcenters. (Explicit formulas 
appear in Section 8.) This function may be analytically continued to a function of the 
lengths of the edges for simplices S' that do not necessarily contain their circumcenters. 
Let vor(S) be defined as the analytic continuation of —46,¢, vol(So) +4solg/3. 

In this case define the score of S to be vor($). Write o(S) for the score of a quasi- 
regular tetrahedron. In summary, the score is 


(Ss) = TS), if the circumradius of S is at most 1.41, 
~ | vor(S) otherwise. 


The quasi-regular tetrahedron S$ appears in four Delaunay stars. In the other three 
Delaunay stars, the distinguished vertices will be v;, v2, and v3, so that S, viewed as a 
standard cluster in the other Delaunay stars, will have scores —doc vol(S;) + sol; /3 (or 
their analytic continuations), fori = 1, 2, 3. By definition, 


3 
T'(S) =} (—8per vol(S;) + sol; /3). 
i=0 


The sum of the scores of $, for each of the four vertices of S$, is 4°($). This is the same 
total that is obtained by summing the compression of S at each of its vertices. This is the 
property we need to relate the score to the density of the packing. It means that although 
the score reapportions the compression among neighboring Delaunay stars, the average 
of the compression over a large region of space equals the average of the score over the 
same region, up to a negligible boundary term. 

The analytic continuation in the definition of vor($) has the following geometric 
interpretation. If the circumcenter of a quasi-regular tetrahedron S' is not contained in S, 
a small tip of the infinite Voronoi cell at vp (or some other vertex) will protrude through 
the opposite face of the Delaunay simplex. The volume of this small protruding tip is 
not counted in —46o¢; vol(So) + 4sol 9/3, but it is counted in the analytic continuation. 
The analytic continuations of the scores of S for each of the other three vertices acquire 
a term representing the negative volume of a part of the tip. The three parts together 
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constitute the entire tip, so that the negative volumes exactly offset the volume of the tip, 
and the sum of the four scores of S is 41'(S). Details appear in [H4]. 

The general definition of the score will have similar properties. To each standard 
cluster of a Delaunay star D* a score will be assigned. The rough idea is to let the score 
of a simplex in a cluster be the compression I'(S) is the circumradius of every face of 
S is small, and otherwise to let the score be defined by Voronoi cells (in a way that 
generalizes the definition for quasi-regular tetrahedra). 

The score o (D*) of a Delaunay star is defined as the sum of the scores of its standard 
clusters. The score has the following properties [H4, 3.1 and 3.5]: 


1. The score of a standard cluster depends only on the cluster, and not on the way it 
sits in a Delaunay star or in the Delaunay decomposition of space. 

2. The Delaunay stars of the face-centered cubic and hexagonal close packings score 
exactly 8 pt. 

3. The score is asymptotic to the compression over large regions of space. We make 
this more precise. Let A denote the vertices of a saturated packing. Let Ay denote 
the vertices inside the ball of radius N. (Fix a center.) Let D*(v) denote the 
Delaunay star at v € A. Then the score satisfies (in Landau’s notation) 


SY o(D*(v)) = Y“P(D*(v)) + O(N’). 
An 


Ay 


Lemma 2.1. Jf the score of every Delaunay star in a saturated packing is at most 
s < 16/3, then the density of the packing is at most 167 3o¢r/(16m — 3s). If the score 
of every Delaunay star in a packing is at most 8 pt, then the density of the packing is at 


most 1// 18. 


Proof. The second claim is the special case s = 8 pt. The proof relies on property 3. 
The number of vertices such that D*(v) meets the boundary of the ball By of radius N 
has order O(N’). Since the Delaunay stars give a fourfold cover of R*, we have 


4 (-b vol(By) + lanl) = 2 boe vol(D*(v)) + a) + O(N?) 
= )-r(D*(v)) + O(N’) 
An 
= Dlo(D*(v)) + OW”) 
< e + O(N’). 
Rearranging, we get 
4m|Anl 80a O(N?) 


3vol(By) — (1—3s/l67) ~ vol(By) 


In the limit the left-hand side is the density and the right-hand side is the bound. Similar 
arguments can be found in [H1] and [H2]. 
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The following conjecture is fundamental. By the lemma, this conjecture implies the 
Kepler conjecture. The lemma also shows that weaker bounds than 8 pt on the score 
might be used to give new upper bounds on the density of sphere packings. 


Conjecture 2.2. The score of every Delaunay star is at most 8 pt. 


The basic philosophy behind the approach of this paper is that quasi-regular tetrahedra 
are the only clusters that give a positive score, standard clusters over quadrilateral regions 
should be the only other clusters that may give a score of zero, and every other standard 
cluster should give a negative score. Moreover, we will prove that no quasi-regular 
tetrahedron gives more than | pt. 

Thus, heuristically, we try to obtain a high score by including as many triangular 
regions as possible. If we allow any other shape, preference should be given to quadri- 
laterals. Any other shape of region should be avoided if possible. If these other regions 
occur, they should be accompanied by additional triangular regions to compensate for the 
negative score of the region. We will see later in this paper that even triangular regions 
tend to give a low score unless they are arranged to give five triangles around each vertex. 

The main steps in a proof of the Kepler conjecture are 


1. A proof that even if all regions are triangular the total score is less than 8 pt. 

2. A proof that standard clusters in regions of more than three sides score at most 
0 pt. 

3. A proof that if all of the standard regions are triangles or quadrilaterals, then the 
total score is less than 8 pt (excluding the case of pentagonal prisms). 

4. A proof that if some standard region has more than four sides, then the star scores 
less than 8 pt. 

5. A proof that pentagonal prisms score less than 8 pt. 


The division of the problem into these steps is quite arbitrary. They were originally 
intended to be steps of roughly equal magnitude, although is has turned out that a con- 
struction in [H4] has made the second step substantially easier than the long calculations 
of the third step. 

This paper carries out the first step. The second step of the program is also complete 
[H4]. Partial results are known for the third step [H5]. In the fourth step it will be 
necessary to argue that these regions take up too much space, give too little in return, and 
have such strongly incompatible shapes that they cannot be part of a winning strategy. 

To make step 5 precise, we define pentagonal prisms to be Delaunay stars whose stan- 
dard decomposition has ten triangles and five quadrilaterals, with the five quadrilaterals 
in a band around the equator, capped on both ends by five triangles (Diagram 2.3). The 
conjecture in this section asserts, in particular, that pentagonal prisms, which created 
such difficulties in [H2], score less than 8 pt. The final step has been separated from the 
third step, because the estimates are expected to be more delicate for pentagonal prisms 
than for a general Delaunay star in the third step. 

One of the main shortcomings of the compression is that pentagonal prisms have 
compression greater than 8 pt (see [H2]). Numerical evidence suggests that the upper 
bound on the compression is attained by a pentagonal prism, denoted Dads in [H2], at 
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Diagram 2.3 


about 8.156 pt, and this means that the link between the compression and the Kepler 
conjecture is indirect. The score appears to correct this shortcoming. 

What evidence is there for the conjecture and program? They come as the result of 
extensive computer experimentation. I have checked the conjecture against much of the 
data obtained in the numerical studies of [H2, 9.3]. The data suggest that the score tends 
to give a dramatic improvement over the compression, often improving the bound by 
more than a point. The score of the particularly troublesome pentagonal prism Dip 
drops safely under 8 pt. I have checked a broad assortment of other pentagonal prisms 
and have found them all to score less than 8 pt. 

The second step shows that no serious pathologies can arise. The only way to form a 
Delaunay star with a positive score is by arranging a number of quasi-regular tetrahedra 
around a vertex (together with other standard clusters than can only lower the score). 
There must be at least eight tetrahedra to score 8 pt, and if there are any distortions in 
these tetrahedra, there must be at least nine. However, as this paper shows, too many 
quasi-regular tetrahedra in any star are also harmful. Future papers will impose additional 
limits on the structure of the optimal Delaunay star. 


3. Quasi-Regular Tetrahedra 


This section studies the compatibility of the Delaunay simplices and the quasi-regular 
solids. Fix three vertices v;, v2, and v3 that are close neighbors to one another. Let T be 
the triangle with vertices v;. It does not follow that T is the face of a Delaunay simplex. 
However, as we will see, when T is not the face of a simplex, the arrangement of the 
surrounding simplices is almost completely determined. 

If T is not the face of a Delaunay simplex, then we will show that there are two 
additional vertices vp and vj, where vp and vp are close neighbors to v;, v2, and v3. This 
means that there are two quasi-regular tetrahedra S; and Sz with vertices (vo, v1, V2, V3) 
and (uy, v1, V2, V3), respectively, that have the common face T (see Diagram 3.1(a)). We 
see that S, U Sz is the union of three Delaunay simplices with vertices (vo, ug, U1, V2), 
(v0, Up; V2, U3), and (vo, Up; v3, V1) in Diagram 3.1(b). This section establishes that this 
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is the only situation in which quasi-regular tetrahedra are not Delaunay simplices: they 
must come in pairs and their union must be three Delaunay simplices joined along a 
common edge. The decomposition of this paper is obtained by taking each such triple of 
Delaunay simplices (3.1(b)) and replacing the triple by a pair of quasi-regular tetrahedra 


(3.1(a)). 
(b) 


V3 


vi 


Wo 
Diagram 3.1 


From the dual perspective of Voronoi cells, the Voronoi cell at vp (or vj) would have 
a small tip protruding from S| through T, if the vertex vj were not present. The vertex 
Uo slices off this protruding tip so that the Voronoi cells at vo and vp have a small face in 
common. 


Lemma 3.2. Suppose that the circumradius of the triangle T is less that 2. Then T 
is the face of a Delaunay simplex. 


Proof. Letr < ./2 be the radius of the circle that circumscribes T, and let c be the 
center of the circle. The sphere of radius r at c does not contain any vertices of D* other 
than vj), v2, and v3. By the definition of the Delaunay decomposition (as described in the 
Introduction), this implies that T is the face of a simplex. 


Remark 3.3. We have several constraints on the edge lengths, if T is not a face of a 
Delaunay simplex. Consider the circumradius n(a, b, c) of a triangle whose edges have 
lengths a, b, c between 2 and 2.51. Since 2.5 12 < 27427, we see that the triangle is acute, 
so that n(a, b, c, ) is monotonically increasing in a, b, and c. This gives simple estimates 
relating the circumradius toa, b, andc. The circumradius is at most n(2.51, 2.51, 2.51) = 
2.51/./3 * 1.449. If the circumradius is at least /2 ~ 1.41421, then a, b, and c are 
greater than 2.3(n(2.3, 2.51, 2.51) © 1.41191 < /2). Under the same hypothesis, two 
of a, b, and c are greater than 2.41(7 (2.41, 2.41, 2.51) < af 2, Finally, at least one edge 
has length greater than 2.44(7 (2.44, 2.44, 2.44) < af 2: 


Lemma 3.4. Let T be the triangle with vertices v;. Assume that the vertices v; are close 
neighbors of one another. Suppose there is a vertex vo that lies closer to the circumcenter 
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of T than the vertices of T do. Then the vertex vo satisfies 2 < |vo — v;| < 2.15, for 
i = 1, 2,3. In particular, the convex hull of vo, . . . , v3 is a quasi-regular tetrahedron S 
with face T. 


Another way of stating the hypothesis on the circumcenter is to say that the plane 
of T separates vg from the circumcenter of S. Because of the constraints on the edge 
lengths in Remark 3.3, the three other faces of S are faces of Delaunay simplices. 


Proof. We defer the proof to Section 8.2.5. 


Lemma 3.5. Left v, vj), v2, v3, and v4 be distinct vertices with pairwise distances at 
least 2. Suppose that the pairs (v;, v;) are close neighbors for {i, j} € {1,4}. Then v 
does not lie in the convex hull of (v1, v2, v3, V4). 


Proof. For a contradiction, suppose v lies in the convex hull. Since |v — v;| > 2 and 
for {i, j} A {1,4}, v; and v; are close neighbors, the angle formed by v; and v; at 
the vertex v is at most 6) = arccos(1 — 2.517/8) ~ 1.357. Each such pair of vertices 
gives a geodesic arc of length at most 6 radians on the unit sphere centered at v. We 
obtain in this way a triangulation of the unit sphere by four triangles, two with edges of 
length at most 6) = 2 arcsin(2.51/4) radians, and two others that fit together to form a 
quadrilateral with edges of at most 69 radians. By the spherical law of cosines, the area 
formula for a spherical triangle, and [H2, 6.1], each of the first two triangles has area at 
most 3 arccos(cos 6)/(1 + cos @))) — a © 1.04. By the same lemma, the quadrilateral 
has area at most the area of a regular quadrilateral of side 9, or about 2.8. Since the 
combined area of the two triangles and quadrilateral is less than 477, they cannot give 
the desired triangulation. To see that v cannot lie on the boundary, it is enough to check 
that a triangle having two edges of lengths between 2 and 2.51 cannot contain a point 
that has distance 2 or more from each vertex. We leave this as an exercise. 


Corollary 3.5.1. No vertex of the packing is ever an interior point of a quasi-regular 
tetrahedron or octahedron. 


Proof. The corollary is immediate for a tetrahedron. For an octahedron, draw the dis- 
tinguished diagonal and apply the lemma to each of the four resulting simplices. O 


Let T be a triangle of circumradius between /2 and 2.51/./3, with edges of length 
between 2 and 2.51. Consider the line through the circumcenter of T, perpendicular to the 
plane of 7’. Let s be the finite segment in this line whose endpoints are the circumcenters 
of the two simplices with face T formed by placing an additional vertex at distance 2 
from the three vertices of T on either side of the plane through 7. 


Lemma 3.6. Let S be a simplex formed by the vertices of T and a fourth vertex vp. 
Suppose that the circumcenter of S lies on the segment s. Assume that vg has distance 
at least 2 from each of the vertices of T . Then vp has distance less than 2.3 from each of 
the vertices of T. 
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Proof. Let vj, v2, v3 be the vertices of T. For a contradiction, assume that Up has 
distance at least 2.3 from a vertex v; of 7. Lemma 3.4 shows that the plane through 
T does not separate vj from the circumcenter of S. If the circumcenter lies in s (and 
is not separated from vg by the plane through T), then by moving vg to decrease the 
circumradius, the circumcenter remains in s. 

Let S be the simplex with vertices uj, v1, v2, and v3. The circumcenter of S lies in 
the interior of S. We omit the proof, because it is established by methods similar to (but 
longer than) the proof of Lemma 3.4. Thus, the circumradius is increasing in the lengths 
of |v — v;|, for i = 1, 2, 3 (see paragraph 8.2.4). 

Let R be the circumradius of a simplex with face T and center an endpoint of s. We will 
prove that the circumradius of S is greater than R, contrary to our hypothesis. Moving uj 
to decrease the circumradius, we may take the distances to v; to be precisely 2.3, 2, and 2. 
We may move v1, v2, and v3 along their fixed circumscribing circle until |v2 — v3| = 2.51 
and |v; — v2| = |v; — v3| in a way that does not decrease any of the distances from vj 
to v;. Repeating the previous step, we may retain our assumption that vj has distances 
exactly 2.3, 2, and 2 from the vertices v; as before. We have reduced the problem to a 
one-dimensional family of tetrahedra parametrized by the radius r of the circumscribing 
circle of T. Set x(r) = |v, — v3| = |v2 — v3|. To obtain our desired contradiction, 
we must show that the circumradius R/(r) of the simplex $(2.3, 2, 2, 2.51, x(r), x(r)) 
satisfies R’(r) > R(r). Since both R’ and R are increasing inr, forr € eee 2.51//3], 
the desired inequality follows if we evaluate the 200 constants 


2.51 j 

R'(ri)— Rig) for: 7 = V2 Ga = 2) aa 

fori = 0,..., 199, and check that they are all positive. (The smallest is about 0.00005799, 
which occurs for i = 199.) O 


Let T be a triangle made up of three close neighbors. Suppose that T is not the face 
of a Delaunay simplex. There exists a vertex vg whose distance to the circumcenter of T 
is less than the circumradius of T. Let S be the quasi-regular tetrahedron formed by vo 
and the vertices of T. It is not a Delaunay simplex, so there exists a vertex vp that is less 
than the circumradius of S from the circumcenter of S. Let S’ be the simplex formed by 
vp and the vertices of T. It is not a Delaunay simplex either. The circumcenter of S lies 
in the segment s of Lemma 3.4, so the circumcenter of S’ does too. The lengths of the 
edges of S and S’ other than T are constrained by Lemmas 3.4 and 3.6. In particular, in 
light of Remark 3.3, the faces other than T of S and S’ are faces of Delaunay simplices. 

If vo and vp lie on the same side of the plane through T, then either vj lies in S, vo 
lies in S’, or the faces of S and S’ intersect nonsimplicially. None of these situations 
can occur because a nondegenerate Delaunay decomposition is a Euclidean simplicial 
complex and because of Lemma 3.5. We conclude that vp and vg lie on opposite sides 
of the plane through T. 

SUS‘ is bounded by Delaunay faces, so SU S’ is a union of Delaunay simplices. The 
fourth vertex of the Delaunay simplex in SUS’ with face (vo, v1, v2) cannot be v3 (S is not 
a Delaunay simplex), so it must be vp. Similarly, (vo, vp, v2, v3) and (vo, vg, V3, V1) are 
Delaunay simplices. These three Delaunay simplices cannot be quasi-regular tetrahedra 
by Lemma 8.3.2. 
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The assumption that T is not a face has completely determined the surrounding 
geometry: there are two quasi-regular tetrahedra S and S’ along T such that SU S’ isa 
union of three Delaunay simplices. 


Lemma 3.7. Let L be a union of standard regions. Suppose that the boundary of L 
consists of three edges. Then either L or its complement is a single triangle. 


For example, the interior of L cannot have the form of Diagram 3.8. Lemma 3.5 
(proof) shows that if all regions are triangles, then there are at least 12 triangles, so that 
the exterior of L cannot have the form of Diagram 3.8 either. 


Diagram 3.8 


Proof. Replacing L by its complement if necessary, we may assume that the area of 
L is less than its complement. The triangular boundary corresponds to four vertices ug 
(the origin), vj, v2, and v3. The close-neighbor constraints on the lengths show that the 
convex hull of vo, ..., v3 is a quasi-regular tetrahedron. By construction each quasi- 
regular tetrahedron is a single cluster. 


We say that a point v € R? is enclosed by a region on the unit sphere is the interior 
of the cone (with vertex vg) over that region contains v. For example, in Diagram 3.9, 
the point v is enclosed by the given spherical triangle. 


% 


Diagram 3.9 


The following lemma was used in Section 2 to define the standard decomposition. 
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Lemma 3.10. Fix a Delaunay star D* with center vo. Draw geodesic arcs on the unit 
sphere at vo for every triple of close neighbors vo, V1, V2 (as in Section 2). The resulting 
system of arcs do not meet except at endpoints. 


Proof. Our proof is based on the fact that a nondegenerate Delaunay decomposition is a 
Euclidean simplicial complex. Let 7, and T be two triangles made from two such triples 
of close neighbors. We have 7; C S; US}, where S; and S; are the Delaunay simplices 
with face 7; if 7; is the face of a Delaunay simplex, and they are the two quasi-regular 
tetrahedra with face T; constructed above, otherwise. Since a nondegenerate Delaunay 
decomposition is a Euclidean simplicial complex, S,; U S| meets S) U S simplicially. 
By the restrictions on the lengths of the edges in Lemmas 3.4 and 3.6, this forces T; to 
intersect 7, simplicially. The result follows. 


4. Quadrilaterals 


Fix a Delaunay star composed entirely of quasi-regular tetrahedra and consider the 
associated triangulation of the unit sphere. Let L be a region of the sphere bounded by 
four edges of the triangulation. L will be the union of two or more triangles. Replacing 
L by its complement in the unit sphere if necessary, we assume that the area of L is less 
than that of its complement. 

We claim that, in this context, L is the union of either two or four triangles, as 
illustrated in Diagram 4.1. In particular, L encloses at most one vertex. If a diagonal to 
the quadrilateral L is an edge of the triangulation, the region L is divided into two triangles 
each associated with a quasi-regular tetrahedron. In particular, there is no enclosed vertex. 
(Lemma 3.7 precludes any subtriangulation of a triangular region.) If, however, there is 
a single enclosed vertex and neither diagonal is an edge of the triangulation, then the 
only possible triangulation of L is the one of the diagram. Proposition 4.2 completes the 
proof of the claim. 


Diagram 4.1 


Proposition 4.2. A union of regions (of area less than 21) bounded by exactly four 
edges cannot enclose two vertices of distance at most 2.51 from the origin. 


This argument is somewhat delicate: if our parameter 2.51 had been set at 2.541, for 
instance, such an arrangement would exist. First we prove a useful reduction. 
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Lemma 4.3. Assume a figure exists with vectors v,,..., 04, v, and v' subject to the 
constraints 

2 < |u;| < 2.51, 

2 < lu — visqil Sk, 

2 < |v; — vj42I, 

24 |v=\, 

hi < |w—ujl, 

2 < |w| <2 for w=v,v and i=1,...,4 (mod4), 


where £, h;, and k; are fixed constants that satisfy € € [2.51, 2/2], h; € [2, 24/2), 
k; € [2,2.51]. Let L be the quadrilateral on the unit sphere with vertices v;/|v;| and 
edges running between consecutive vertices. Assume that v and v' lie in the cone at the 
origin obtained by scaling L. Then another figure exists made of a (new) collection of 


vectors V1, ..., V4, v, and v' subject to the constraints above together with the additional 
constraints 
lu; — Vigil = Kj, 
luj] = 2 for i=1,...,4, 
el shee 


Moreover, the quadrilateral L may be assumed to be convex. 


Proof. By rescaling v and v’, we may assume that |v| = |v’| = €. (Moving v or v’ 
away from the origin increases its distance from the other vertices of the configuration.) 

The diagonals satisfy |v2—v4| > 2.1 and |vj —v3| > 2.1. Otherwise, if say |v; — v3| < 
2.1, then the faces with vertices (v,, v2, v3) and (v1, v4, v3) have circumradius less than 
./2. The edge from 0 to v has length at most V2, so this edge cannot intersect these faces 
by the Euclidean simplicial complex argument used before Lemma 3.7 and in Lemma 
3.10. By Lemma 3.5, v cannot lie in the convex hull of (0, v1, v3, v;). This leaves v 
nowhere to go, and a figure with |v; — v3| < 2.1 does not exist. 

Next, we claim that we may assume that the quadrilateral L is convex (in the sense 
that it contains the geodesic arcs between any two points in the region). To see this, 
suppose the vertex v; lies in the cone over the convex hull of the other three vertices v;. 
Consider the plane P through the origin, v;-1, and vj. The reflection v; of v; through 
P is no closer to v, v’, or v;+2 and has the same distance to the origin, v;_;, and vj;+1. 
Thus, replacing v,; with v; if necessary, we may assume that L is convex. 

Most of the remaining deformations are described as pivots. We fix an axis and 
rotate a vertex around a circle centered along and perpendicular to the given axis. If, for 
example, |v3 — v4| < k3, we pivot the vertex v4 around the axis through 0 and v, until 
|v3 — v4| = k3. It follows from the choice of axes that the distances from v4 to the origin 
and v, are left unchanged, and it follows from the convexity of L that |w — v4| increases 
for w = v2, v3, v, and v’. Similarly, we may pivot vertices v; along the axis through the 
origin and vj; until |v; — vj—1| = &;, for all 7. 

Fix an axis through two opposite vertices (say v; and v3) and pivot another vertex 
(say v2) around the axis toward the origin. We wish to continue by picking different axes 
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and pivoting until |v;| = 2, fori = 1,2, 3,4. However, this process appears to break 
down in the event that a vertex v; has distance h; from one of the enclosed vertices and 
the pivot toward the origin moves v; closer to that enclosed vector. We must check that 
this situation can be avoided. 

Interchanging the roles of (v1, v3) with (v2, v4) as necessary, we continue to pivot 
until |v;| = |v3| = 2 or |v2| = |v4| = 2 (say the former). We claim that either vz has 
distance greater than h2 from v or that pivoting v2 around the axis through v; and v3 
moves v2 away from v. If not, we find that |v;| = |v3| = 2, |v — v2| = h2 and that v lies 
in the cone C = C(v2) with vertex v2 spanned by the vectors from v2 to the origin, v;, 
and v3. (This relies on the convexity of the region L.) 

To complete the proof, we show that this figure made from (0, v1, v2, v3, v) cannot 
exist. Contract the edge (v, v2) as much as possible keeping the triangle (0, v;, v3) fixed, 
subject to the constraints that v € C(v2) and |w — w’| > 2, for w = v, v2 and w’ = 0, 
v1, v3. This contraction gives |v — v2| < hy < 24/2. 


Case 1: v lies in the plane of (0, v2, v3). This gives an impossibility: crossing edges 
(v, V2), (0, v3) of length less than Daf, Similarly, v cannot lie in the plane of (0, v;, v2). 


Case 2: v lies in the interior of the cone C(v2). The contraction gives |v — w’| = 2 for 
w =v, v, and w’ = 0, vj, v3. The edge (v, v2) divides the convex hull of (0, v1, v2, v3, v) 
into three simplices. Consider the dihedral angles of these simplices along this edge. The 
dihedral angle of the simplex (vj, v2, v3, v) is less than 2. The dihedral angles of the 
other two are less than dih(S(2V2, 2,2,2,2,2)) = 2/2. Hence, the dihedral angles 
along the diagonal cannot sum to 27 and the figure does not exist. 


Case 3: v lies in the plane of (v2, v1, v3). Let r be the radius of the circle in this plane 
passing through v, and v3 obtained by intersecting the plane with a sphere of radius 2 at 
the origin. We have 2r > |v, — v3| > 2/2 because otherwise we have the impossible 
situation of crossing edges (v;, v3) and (v2, v) of length less than 2/2. Let H be the 
perpendicular bisector of the segment (v1, v3). By reflecting v through H if necessary 
we may assume that v and v2 lie on the same side of H, say the side of v3. Furthermore, 
by contracting (v, v2), we may assume without loss of generality that |v3 — v| = |v2 — 
v3| = 2. Let f(r) = |v — vol, as a function of r. f is increasing in r. The inequalities 
2/2 > hg > |v-—w| > f (/2) = 2/2 give the desired contradiction. 


Proof of Proposition 4.2. Assume for a contradiction that v and v’ are vertices enclosed 
by L. Let the center of the Delaunay star be at the origin, and let v,,..., v4, indexed 
consecutively, be the four vertices of the Delaunay star that determine the extreme points 
of L. 

We describe a sequence of deformations of the configuration (formed by the vertices 
V1,..., U4, U, U’) that transform the original configuration of vertices into particular rigid 
arrangements below. We show that these rigid arrangements cannot exist, and from this it 
follows that the original configuration does not exist either. These deformations preserve 
the constraints of the problem. To be explicit, we assume that 2 < |w| < 2.51, that 
2 < |w —»v;| if w ¥ vy, that 2 < |v — v’|, and that |v; — v;41| < 2.51, fori = 1,2, 3,4 
and w = vj,..., U4, v, v’. Here and elsewhere we take our subscripts modulo 4, so that 
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v, = vs, and so forth. The deformations also keep v and v’ in the cone at the origin that 
is determined by the vertices vj. 

We consider some deformations that increase |v — v’|. By Lemma 4.3, we may 
assume that |v;| = 2, |v] = |v’| = Ju; — vjai| = 2.51, fori = 1,2,3,4. If, for 
some i, we have |v; — v| > 2 and |v; — v’| > 2, then we fix v;_; and v;+2 and pivot 
vj41 around the axis through the origin and v;;2 away from v and v’. The constraints 


|vi41 — v;| = |v; — vj;-1| = 2.51 force us to drag v; to a new position on the sphere 
of radius 2. By making this pivot sufficiently small, we may assume that |v; — w| and 
|via1 — wl, for w = v, v’, are greater than 2. 


The vertices v and v’ cannot both have distance 2 from both v;,2 and v;_1, for then 
we would have v = v’. So one of them, say v, has distance exactly 2 from at most one of 
Uj42 and v;_1 (Say vj+2). Thus, v may be pivoted around the axis through the origin and 
vi42 away from v’. In this way, we increase |v — v’| until |v; — v| = 2 or |v; — v'| = 2, 
fori = 1,2,3,4. 

Suppose one of v, v’ (say v) has distance 2 from v;, v;+1, and v;+2. The configuration 
is completely rigid. By symmetry, the vertices v;_; and v’ must be the reflections of 
vi41 and v, respectively, through the plane through 0, v;, and v;+2. In particular, v’ has 
distance 2 from v;, v;-1, and v;42. We pick coordinates and evaluate the length |v — v’|. 
We find that |v — v’| ~ 1.746, contrary to the hypothesis that the centers of the spheres 
of our packing are separated by distances of at least 2. Thus, the hypothesis that v has 
distance 2 from three other vertices is incorrect. 


V3 


(a) (b) 


Diagram 4.4 


We are left with one of the configurations of Diagram 4.4. An edge is drawn in the 
diagram, when the distance between the two endpoints has the smallest possible value 
(that is, 2.51 for the four edges of the quadrilateral, and 2 for the remaining edges). Deform 
the figure of case (a) along the one remaining degree of freedom until |v — v’| = 2. In 
case (a), referring to the notation established by Diagram 4.5, we have a quadrilateral 
on the unit sphere of edges f; = 2 arcsin(2.51/4) © 1.357 radians, t) = arccos(2.51/4) 
radians, and #3 = 2 arcsin(1/2.51) radians. The form of this quadrilateral is determined 
by the angle q, and it is clear that the angle B(@) is decreasing in a. We have 


6 = dilmax = dih($(2.51, 2,2, 2.51, 2, 2)) © 1.874444. 
The figure exists if and only if there exists a such that (aw) + B(27 —0—a@) = 27 — 0. 
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By symmetry, we may assume that a < 7 — 0/2 © 2.20437. The condition |v — v,| > 2 
implies that 


COS fy — COS f2 COS £3 
a> arcos ( > 1.21. 


sin fo Sin £3 


However, by monotonicity, 
B(a) + BQnr —0—a) < B(a;) + BQnm —0-—a; — 0.1) < 207 - 8, 


fora; <a < 0.1+a;, witha; = 1.21+0.li,andi = 0,1,...,9,asadirect calculation 
of the constants B(@;) + B(2m — 0 — a; — 0.1) will reveal. (The largest constant, which 
is about 27 — 0 — 0.113, occurs for i = 0.) Hence, the figure of Diagram 4.4(a) does 
not exist. 


Diagram 4.5 


To rule out Diagram 4.4(b), we reflect v, if necessary, to its image through the plane 
P through (0, v1, v3), so that P separates v and v’. The vertex v can then be pivoted away 
from v’ along the axis through v; and v3. This decreases |v|, but we may rescale so that 
|v| = 2.51. Eventually |v — v2| = 2 or |v — v4| = 2. This is the previously considered 
case in which v has distance 2 from three of the vertices v;. This completes the proof 
that the original arrangement of two enclosed vertices does not exist. oO 


5. Restrictions 


If a Delaunay star D* is composed entirely of tetrahedra, then we obtain a triangulation 
of the unit sphere. As explained in Section 2, we wish to prove that no matter what the 
triangulation is, we always obtain a score less than 8 pt. In this section we make a long 
list of properties that a configuration must have if it is to have a score of 8 pt or more. 
The next section and the Appendix show that only one triangulation satisfies all of these 
properties. Additional arguments will show that this triangulation scores less than 8 pt. 
This will complete the proof of Theorem 1. 

In this section the term vertices refers to the vertices of the triangulation of the 
unit sphere. The edges of the triangulation give a planar graph. We adopt the standard 
terminology of graph theory to describe the triangulation. We speak of the degree of a 
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vertex, adjacent vertices, and so forth. The n triangles around a vertex are referred to as 
an n-gon. We also refer to the corresponding n tetrahedra that give the triangles of the 
polygon. We say that a triangulation contains a pattern (a), ..., d,), fora; € N, if there 
are distinct vertices v; of degrees a; that are pairwise nonadjacent, fori = 1,...,. Let 
N be the number of vertices in the triangulation, and let N; be the number of vertices of 
degree i. We have N = )- Nj. 

In this section we start to use our various inequalities from Section 9 related to the 
score. Since the score o (S) may be either (S$) or vor(S) depending on the circumradius 
of S, there are two cases to consider for every inequality. In general, the inequalities for 
I'(S) are more difficult to establish. In the following sections we only cite the inequalities 
pertaining to (S$). Section 9 shows how all the same inequalities hold for vor(S). 


Proposition 5.1. Consider a Delaunay star D* that is composed entirely of quasi- 
regular tetrahedra. Suppose that o (D*) > 8 pt. Then the following restrictions hold on 
the triangulation of the unit sphere given by the standard decomposition: 


1 13<N <15. 

2. N=Ny+N5+ No. 

3. A region bounded by three edges is either a single triangle or the complement of 
a single triangle. 

4. Two degree 4 vertices cannot be adjacent. 

5. N4 <2. 

6. Patterns (6, 6, 6) and (6, 6, 4) do not exist. 

7. The pattern (6, 6) or (6, 4, 4) implies that N > 14. 

8. If there are two adjacent degree 6 vertices, and a third degree 6 vertex adjacent to 
neither of the first two, then N = 15 and all other vertices are adjacent to at least 
one of these three. 

9. The triangulation is made of geodesic arcs on the sphere whose radian lengths are 
between 0.8 and 1.36. 


Lemma 5.2. Consider a vertex of degree n, for some 4 <n < 7. Let S\,..., Sy be the 
tetrahedra that give the n triangles. Then paar o(S;) is less than Z,, where z4 = 0.33 pt, 
Z5 = 4.52 pt, Zz = —1.52 pt, and z7 = —8.9 pt. Suppose that n > 6. Let S,..., S4 be 


any four of the n tetrahedra around the vertex. Then yo o(S;) < 1.5 pt. 


Proof. Whenn = 4, this is Lemma 9.5. When n = 5, this is Lemma 9.6. When n = 6 
orn = 7, we have, by Calculation 9.4, 


Y *o(S;) < > 5 (0.378979 dih(S;) — 0.410894) 
i=1 i=l 


= 27 (0.378979) — 0.410894n. 


The right-hand side evaluates to about — 1.520014 pt and —8.940403 pt, respectively, 
when n = 6 andn = 7. 

Assume that n > 6, and select any four S,,..., $4 of the n tetrahedra around the 
vertex. Let Ss and S¢ be two other tetrahedra around the vertex. The dihedral angles of 
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Ss and S¢ are at least dihnin, by Calculation 9.3. Each of the four triangles (associated 
with S;,..., 54) must then, on average, have an angle at most (27 — 2 dihmin)/4 at v. 
By Calculation 9.4, 


4 4 
CD) 2 Y °(0.378979 dih(S;) — 0.410894) 
i=l i=l 


< (20 — 2 dihpin)0.378979 + 4(—0.410894) < 1.5 pt. O 


Proof of Proposition 5.1. Lett = 2(N —2) be the number of triangles, an even number. 
By Euler’s theorem on polyhedra, 


3N3 + 2N4+ N5 +0No — N7--- = 12. 


Let S;,..., 5S, denote the tetrahedra of D*. Let o; = 0 (S;) be the corresponding score. 
Let sol; denote the solid angle cut out by S; at the origin. We have }~* sol; = 427. Often, 
without warning, we rearrange the indices i so that the tetrahedra that give the triangles 
around a given vertex are numbered consecutively. When a vertex v of the triangulation 
has been fixed, we let @; denote the angles of the triangles at v, so that a; = 27. The 
angle qa; of a triangle is equal to the corresponding dihedral angle of the simplex S;. We 
abbreviate certain sums over n elements to }°,,), when the context makes the indexing 
set clear. Throughout the argument, we use Calculation 9.1, which asserts that o; < | pt, 
for all i. The proofs show that if a triangulation fails to have any of properties 1-9, then 
the total score must be less than 8 pt. 
(Proof of 5.1.1.) Assume that tf > 28. By Calculation 9.9, 


Soo; < Y °(0.446634 sol; —0.190249) < 47 (0.446634) + +(—0.190249) 
(t) (t) 
< 4 (0.446634) + 28(—0.190249) < 8 pt. 


Suppose that t < 18. By Calculation 9.8, 


2 Oi < Y °(—0.37642101 sol; + 0.287389) = 47 (—0.37642101) + t(0.287389) 
(t) (t) 
< 42 (—0.37642101) + 18(0.287389) < 8 pt. 


This proves that 12 < N < 15. The case N = 12 is excluded after 5.1.2. 

(Proof of 5.1.2.) By Lemma 8.3.2 and Calculation 9.3, each angle a of each triangle 
satisfies dihmin < @ < dihmax. In particular, a > 2/4, so that each vertex has degree less 
than 8, and a < 27/3, so that each vertex has degree greater than 3. 

Assume for a contradiction that there is a vertex of degree 7. Consider the seven 
tetrahedra around the given vertex. By Lemma 5.2, the tetrahedra satisfy 


pee: < —8.9 pt. 
(7) 


Suppose t < 24. For each vertex v, set ¢, = )_,, o;, the sum running over the tetrahedra 
around v. Clearly, Yo 6; = (1, Sv) /3. Pick a vertex v that is not a vertex of any of the 
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seven triangles of the heptagon. By Lemma 5.2, we see that ¢, < 0.33 pt if v has degree 
4, ty < 4.52 pt if v has degree 5, ¢, < —1.52 pt if v has degree 6, and ¢, < —8.9 pt if 
v has degree 7. In particular, if v has degree n, then ¢, falls short of n points by at least 
0.48 pt. Thus, 


Soa; < doi + D> pt — 0.48 pt < (—8.9 + (24 — 7) — 0.48) pt < 8 pt. 
(t) (7) (t—7) 


Finally, we assume that f = 26 and N = 15. If Ne = Oand N7 = 1, then Euler’s theorem 
gives the incompatible conditions 2N4 + Ns — 1 = 12 and N4 + Ns + 1 = 15. Thus, 
No > Oor N7 > 1. This gives a second k-gon (k = 6 or 7) around a vertex v. This 
second polygon shares at most two triangles with the original heptagon. This leaves at 
least four triangles of a second polygon exterior to the first. Thus, by Lemma 5.2, 


Yoo s Doo + lo; + DO pt < -8.9 pt +15 pt + (26 — 11) pt < 8 pt. 
(26) (7) (4) (t—11) 


(Proof of 5.1.1, continued) Assume N = 12. We define three classes of quasi-regular 
tetrahedra. In the first class all the edges have lengths between 2 and 2.1. In the second 
class the fourth, fifth, and sixth edges have lengths greater than 2.1. The third class is 
everything else. 

Set ¢ = 0.001, a = —0.419351, b = 0.2856354. The following are established by 
Calculations 9.10—9.1 


oe 


o(S) < asol($)+b+e for S in the first class, 
o(S) < asol(S)+b for S in the second class, 
o(S$) < asol(S)+b—5e for S in the third class. 
Consider a vertex v of degree n = 4, 5, or 6 and the surrounding tetrahedra S;,..., Sy. 


We claim that 


(5.1.1.1) Y\o(S;) < ¥(asol(S;) +d). 
i=l i=l 
This follows directly from the stated inequalities if none of these tetrahedra are in the 
first class. It is also obvious if at least one of these tetrahedra is in the third class, because 
then the inequality is violated by at most —S5e + (n — l)e < 0. So assume that all of the 
tetrahedra are in the first two classes with at least one in the first class. By the restrictions 
on the lengths of the edges, a tetrahedron in the first class cannot be adjacent to one in 
the second class. We conclude that the tetrahedra are all in the first class. 
By Lemma 5.2, z, < n(0.904) pt. If or, sol($;) < n(0.56176), then 
Y°o(S;) < 00.904) pt < Y \(a(0.56176) + b) < (a sol(S;) + b). 
(n) (n) (n) 


So we may assume that oN sol(S;) => n(0.56176). By Calculation 9.13, 


Y\o(S;) < > 5(—0.65557 sol(S;) + 0.418) 
(n) (n) 
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< Yi@ sol(S;) + b) + Y °(0.132365 — 0.236219(0.56176)) 
(n) (n) 
< XC sol(S;) + b). 
(n) 
This establishes inequality (5.1.1.1). By averaging over every vertex, we see that the 
average of the scores o (S) is less than the average of a sol(S) + b. So 


o(D*) = > a(S) < xc sol(S) + b) = 41a + 20b © 7.99998 pt < 8 pt. 
(t) (t) 


(Proof of 5.1.3.) This is Lemma 3.7. (If we had not introduced quasi-regular tetrahedra, 
then this result would no longer hold.) 

(Proof of 5.1.4.) If two degree 4 vertices are adjacent, we then have the arrangement 
of Diagram 5.3. This is a quadrilateral enclosing two vertices, contrary to Proposition 
4.2. 


Diagram 5.3 


(Proof of 5.1.5.) As in the proof of 5.1.2, for each vertex v, set ¢, = », o;, the sum 
running over the tetrahedra around the vertex v. We use the estimates of ¢, that appear 
in Lemma 5.2. 

We have found that N; = 0, if i 4 4,5, 6. By Euler’s theorem, Ns = 12 — 2N4 and 
No = Na+ N — 12. Assume that N > 13 and that N4 > 3. Then 


Soo; = 49° < 100.334 + 4.52(12 — 2g) — 1.52(Ng + N — 12)) pt 
(t) (N) 


IA 


1(0,33(3) + 4.52(12 — 6) — 1.52(3 + 13 — 12)) pt < 8 pt. 


(Proof of 5.1.6.) Suppose that we have the pattern (6, 6, 6) Then reordering indices 
according to the polygons in the pattern, we have, by the estimates of Lemma 5.2, 


Vee yo) a a oo 
(t) (6) 


(6) (6) (1-18) 
< —1.52 pt — 1.52 pt — 1.52 pt + (26 — 18) pt < 3.5 pt. 


Similarly, if we have the pattern (6, 6, 4), then we find 


Yoo; < -1.52 pt — 1.52 pt + 0.33 pt + (26 — 16) pt < 8 pr. 
(t) 
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(Proof of 5.1.7.) We use the same method as in the proof of 5.1.6. If we have the 
pattern (6, 6), and if t < 22, then 


Yio < doo + > 00; + >> 0; < —1.52 pt — 1.52 pt + (22 — 12) pt < 8 pt. 
(t) (6) (6) (t—12) 


Similarly, if we have the pattern (6, 4, 4), then Yo o; is less than —1.52 pt + 0.33 pt + 
0.33 pt + 8 pt < 8 pt. 

(Proof of 5.1.8.) Let the two adjacent degree 6 vertices be v; and v2. Let the third be 
v3. The six triangles in the hexagon around v3 give less than —1.52 pt. The ten triangles 
in the hexagons around v,; and v2 give at most 


Yoo; < Yoo; < 1.5 pt — 1.52 pt < Opt, 
(4) (6) 


by the argument described in the case t = 26 of 5.1.2 (see Lemma 5.2). 

Suppose that t < 24. There remain at most 24 — 16 = 8 triangles, and they give 
a combined score of at most 8 pt. The total score is then less than (—1.52 + 8) pt, as 
desired. 

Now assume that t = 26. Suppose there is a vertex v that is not adjacent to any of v;, 
U2, or v3. As in the proof of 5.1.8, the ten triangles in the two overlapping hexagons give 
less than 0 pt. The other hexagon gives less than —1.52 pt. By Lemma 5.2, the n triangles 
around v fall short of n points by at least (5 — 4.52) pt = 0.48 pt. Each of the remaining 
triangles gives at most | pt. The score is then less than (O— 1.52 + (26 — 16) —0.48) pt = 
8 pt, as desired. 

(Proof of 5.1.9.) This follows directly from the construction of the triangulation and 
the close-neighbor restrictions on the lengths of the edges of a quasi-regular tetrahedron. 
The lengths are between 0.8 < 2arcsin(1/2.51) and 2 arcsin(2.51/4) < 1.36. This 
completes the proof of the proposition. 


6. Combinatorics 


Theorem 6.1. Suppose that a triangulation satisfies Proposition 4.2 and properties 
1-9 of Proposition 5.1. Then it must be the triangulation of Diagram 6.2 with 14 vertices 
and 24 triangles. 


Proof. Fix a polygon centered at a vertex uo, such as the hexagon in Diagram 6.3. The 
six vertices v1,..., V6 of the hexagon are distinct, for otherwise two distinct geodesic 


Diagram 6.2 


402 


Sphere Packings, I 23 


%G ‘i 
Of 
V4 V3 

Diagram 6.3 


arcs on the sphere would run between up and v; for some i. This is impossible, because uo 
and v; are not antipodal by Property 5.1.9. Similarly, the vertices of every other polygon 
of the triangulation are distinct. 

We then extend the polygon to a second layer of triangles. Each of the vertices v; has 
degree 4, 5, or 6, and two degree 4 vertices cannot be adjacent. The new vertices are 
denoted w ,..., wg. One example is shown in Diagram 6.4. 


Diagram 6.4 


There can be no identification of a vertex v; with a vertex w;, for otherwise there 
is a triangle (say with vertices wj, vg, Uo, vj = wy) that is subtriangulated, contrary to 
Property 5.1.3. Similarly, there is no identification of two vertices w; and wj, for otherwise 
it can be checked that there is a quadrilateral (with vertices w;, vx, Uo, Ue, Wi = Wi) 
that encloses more than one vertex, which is impossible by Proposition 4.2. A purely 
combinatorial problem remains. It is solved in the Appendix. 


Proposition 6.5. The triangulation of Theorem 6.1 scores less than 8 pt. 


Proof. Our initial bound on the score comes by viewing the triangulation as made up 
of two hexagons and twelve additional triangles. By Lemma 5.2 and Calculation 9.1, 


(6.6) Soo < doo + doo; + >) pt < -1.52 pt — 1.52 pt + 12 pt = 8.96 pr. 
(24) (6) (6) (12) 


A refinement is required in order to lower the upper bound to 8 prt. 
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If a vertex v has height |v| > 2.2, then, by Calculation 9.2, we find o; < 0.5 pt for 
the tetrahedra at v. Thus, in inequality (6.6), if the vertex v of some pentagon has height 
|v| > 2.2, then the term }°(,5) pt may be replaced by } (9) pf + )/ (3) 0.5 pt (there are 
three triangles in the pentagon that do not belong to the hexagon), and the upper bound 
on the score falls to 7.46 pt. 

If a vertex of degree 6 has height |v| > 2.05, then we claim that Yo 0; < —3.04 pt. 
In fact, by Calculation 9.7, the hexagon gives 


Yo; < > °(0.389195 dih(S;) — 0.435643) 
(6) (6) 
= 27 (0.389195) — 6(0.435643) < —3.04 pt. 


Estimate (6.6) is improved to 


Y\ oj < —3.04 pt — 1.52 pt + 12 pt = 7.44 pt. 
(24) 


If the twelve tetrahedra have combined solid angle less than 6.48, then, by Calculation 
9.9, 


Yio; < > (0.446634 sol(S;) — 0.190249) 
(12) (12) 
< 6.48(0.446634) + 12(—0.190249) < 11.039 pr. 


Then estimate (6.6) is improved to the bound — 1.52 pt— 1.52 pt+ 11.039 pt = 7.999 pt. 
Now assume, on the other hand, that the combined solid angle of the two hexagons is 

at most 47 — 6.48. Set K = (4m — 6.48) /12 and define o’(S) := o(S)+(K —sol(S))/3. 

The solid angle of one of the two hexagons is at most 6K. For that hexagon, we have 


P 6K — sol($;) 
Yo'(Si) = Droit ze > Va. 
(6) (6) 


(6) 


By our previous estimates, we now assume without loss of generality that the heights 
|v| of the vertices of triangles in the hexagon are at most 2.05, 2.2, and 2.2, the bound of 
2.05 occurring at the center of the hexagon. By Calculation 9.15, 


2 o'(S;) > 0.564978 dih; —0.614725 
(6) (6) 
2m (0.564978) + 6(—0.614725) < —2.5 pt. 


IA 


Estimate (6.6) becomes 


Yoo; < D2 0(S) + Yo + Doo; < (-2.5 — 1.52 + 12) pt = 7.98 pt. 


(24) (6) (6) (12) 


This completes the proof of the proposition. 
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7. The Method of Subdivision 


The rest of this paper is devoted to the verification of the inequalities that have been used 
in Sections 5 and 6. In this section we describe the method used to obtain several of our 
bounds. We call it the method of subdivision. Let p(x) = 5°, c;x! be a polynomial with 
real coefficients c;, where I = (i},..., in), X = (X1,...,X,) € R",andx! = x, ee 
It is clear that if C is the product of intervals [a;, b)] x --+ X [dn, bn] C R" in the positive 
orthant (a; > 0, fori = 1,..., 7), then 


Vx € C, Pmin(C) = p(x) s Pmax(C), 


where 


Pmin(C) = > cal + > cyb’ and Pmax(C) = y cb! + > cya’. 


c,>0 ce, <0 cy>0 c, <0 
Another bound comes from the Taylor polynomial p(x) = >> d;(x — a)! at a: 


(7.1) d+ Yi d(b—a)' <pa)sdt+ YD) aba)! 
d,<0,140 d;>0,140 


If r(x) = p(x)/q(x) is a rational function, and if gmin(C) > 0, then 
Pmin(C) Pmax(C) 


S71) = 3) 

q(C, p) q2(C, Pp) 
where qi(C, p) (resp. g2(C, p)) is defined as gmax(C) whenever Pmin(C) > O (resp. 
Pmax(C) < 0) and as gmin(C) otherwise. 

We define a cell to be a product of intervals in the positive orthant of R”. By covering 
a region with a sufficiently fine collection of cells, various inequalities of rational func- 
tions are easily established. To prove an inequality of rational functions with positive 
denominators (say 11(x) < r2(x), for all x € C), we cover C with a finite number of 
cells and compare the upper bound of r; (x) with the lower bound of r2(x) on each cell. 
If it turns out that some of the cells give too coarse a bound, then we subdivide each 
of the delinquent cells into a number of smaller cells and repeat the process. If at some 
stage we succeed in covering the original region C with cells on which the upper bound 
of 7; (x) is less than the lower bound of r(x), the inequality is established. 

A refinement of this approach applies the method to the partial derivatives. If, for 
instance, we establish by the method of subdivision that, for some /, 


Vx EC, 


ap 
Ox; 


(x) > 0, Vx EC, 


then we may compute an upper bound of p by applying the method of subdivision to the 
polynomial obtained from p by the specialization x; = b;, where b; is the upper bound 
of x; on C. Thus, we obtain an upper bound on a polynomial by fixing all the variables 
that are known to have partial derivatives of fixed sign and then applying the method 
subdivision to the resulting polynomial. Similar considerations apply to lower bounds 
and to rational functions with positive denominators. 
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It is a fortunate circumstance that many of the polynomials we encounter in sphere 
packings are quadratic in each variable with negative leading coefficient (u, 0, A, x in 
the next section). In this case the lower bound is attained at a corner of the cell. Of 
course, the maximum of a quadratic function with negative leading coefficient is also 
elementary: —a(x — xo)? + B < B,ifa > 0. 


8. Explicit Formulas for Compression, Volume, and Angle 


Many of the formulas in this section are classical. They can typically be found in 19th 
century primers on solid geometry. The formula for solid angles, for example, is due to 
Euler and Lagrange. For anyone equipped with symbolic algebra software, the verifi- 
cations are elementary, so we omit many of the details. All formulas in this section are 
valid for Delaunay simplices and for quasi-regular tetrahedra, unless otherwise noted. 


8.1. The Volume of a Simplex 
As in the previous section, a cell in R” is a product of intervals in IR”. We define a 
function A: [4, 16]© Cc R° > R by 
(8.1.1) A(X], ---,)%X6) = XyX4(—xXy + Xo + x3 — X4 + X5 + XG) 
+ x2x5(X1 — X2 +23 -+X4 — X5 + X6) 
+ X3X6(X1 + X2 — x3 +.X4 + X5 — X6) 


XIAX3X4 X{X3X5 X1{X2X6 X4AX5X6G- 


We set y; = ./x;, fori = 1,..., 6. This relationship between x; and y; remains in force 
to the end of the paper. Index the edges of a simplex as in Diagram 8.1.2. 


Diagram 8.1.2 


We also define u: [4, 16]? > R by 


(8.1.3) u(xX1, X2,X%6) = (1 + y2 + Yo) + Y2 — Yo) — Y2 + Yo) 
x (~y1 + y2 + Yo) 


= —x? _ cy _ ue + 2x1x6 + 2x1 x7 + 2x2X6. 
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Lemma 8.1.4. There exists a simplex of positive volume with edges of length y1,..., Ye 
if and only if A(x1,...,X6) > 0. If these conditions hold, then the simplex has volume 
W545 eis ey 12, 


Proof. The function u = u(x;, x2, x6) is quadratic in each variable, with negative 
leading coefficient, so the minimum of u, which is 0, is attained at a vertex of the cube 
[4, 16]. At the vertices where the minimum is attained, A(x, ..., x6) <0. 

Assume A > (0. Then u > 0, and a simplex exists with vertices 0, X = (1, 0, 0), 
Y= 5 (x, u'/?/y,,0), Z = (*, *, (A/u)'/). Conversely, if the simplex exists, then it 
must be of the given form, up to an orthogonal transformation, so u > 0 (otherwise O, 
X, and Y are colinear) and A > 0. The volume is |det(X, Y, Z)|/6 = Aer, 


Let C be acell contained in [4, 16]°. The minimum of A on C is attained at a vertex of 
C: this is clear because A is quadratic in each variable with negative leading coefficient. 
To obtain an upper bound on A on a cell, we may use the method of Section 7. The 
restriction of A to the zero set of dA /0dx, is 


U(X4, X53, XG) - U(X2, X3, X4) 


4x4 


This is an upper bound on A(x, ..., X6). 


8.2. The Circumradius of a Simplex 


The circumradius of a face with edges y4, ys, and yo is 

n(v4, Ys, Ye) Y4Y5 V6 
4, ¥51 6) = +5 9 2179" 

w(yg, 5+ Ye)? 


Define p: [4, 16]° > R by 
(8.2.1) 
P(xX%1,...,%) = —xix? _ awe — wex2 + 2x, x2x4x5 + 2x1x3X4X6 + 2X2X3X5X6. 


Lemma 8.2.2. Suppose A > 0; then p > 0. 


Proof. If A > 0, then the simplex of Lemma 8.1.4 exists. Let the coordinates of the 
circumcenter of the simplex be (x, y, z). Direct calculation shows that 0 < x*+y?+z? = 
p/(4A). 


Corollary 8.2.3. Let S be a Delaunay simplex of positive volume with edges of lengths 
yi = ./x;. The circumradius is 5(p/A)!/?, 


Set 


X(X1, X2,.X3,X4,X5,XG) = XpXgXs + Xp XoX4 + XXGXs + X2X4X5 + X5X3X6 


+ x3X4X6 — 2x5X6X4 — Hix4 _ waxes _ Meee: 
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We have 


p ee ces 

ZA 10% ¥51 Ye) = i CRE TN 
The vanishing of x is the condition for the circumcenter of the simplex to lie in the 
plane through the face T bounded by the fourth, fifth, and sixth edges. x is positive if 
the circumcenter of S and the vertex of S opposite T lie on the same side of the plane 
through T and negative if they lie on opposite sides of the plane. We say that T has 
positive orientation when x > 0. 


8.2.4. The function p is quadratic in each of the variables x,,..., x6 with negative 
leading coefficient. Thus, the minimum of p is attained at a vertex of a given cell 
C. The derivative is A*A(p/A)/dx\ = X(X5, X6, X1, X2, X3, X4) X (X1, X2, X3, X4, X5, XG). 
This leads to bounds on the circumradius by the method of Section 7. 

The circumradius of a quasi-regular tetrahedron is increasing in x1, ..., x6 if the ori- 
entation of each face is positive. When a face fails to have a positive orientation, it satisfies 
the constraints of Section 3 (for example, y;, yo, y3 € [2, 2.15], ya, ys, vo € [2.3, 2.51], 
nya, Ys, Yo) = /2, if it is the face opposite the origin). This allows us to determine 
bounds on the circumradius for most of the quasi-regular tetrahedra we encounter in this 
paper by inspection. For example, in Section 9 we study the simplices constrained by 
y; € [2, 2.1]. An upper bound on the circumradius is rad($(2.1, 2.1, 2.1, 2.1, 2.1, 2.1)). 


8.2.5. Proof of Lemma 3.4. In the notation of Section 3.4 let T be the given face, and 
let vo be a vertex satisfying the conditions of the lemma. Let S' by the simplex at the 
origin vg = 0, whose first, second, and third edges abut at v;, fori = 1, 2, 3. Suppose 
for a contradiction that |vg — v3|" = x3 > 2.157. In light of the results of this section, 
a contradiction follows if x(x1,...,%6) > 0, for x; > 4 fori = 1,2,x3 > 2.152, 
2.32 < x; < 2.51? fori = 4,5, 6. (The lower bound of 2.3 comes from Remarks 3.2 
and 3.3: the circumradius of T is at least /2). Since T is acute, 


ae = X4(—xX4 + x5 + x6) > O. 

Chal 
Similarly, x is increasing in x2 and x3. Thus, to minimize x, we take xj = x2 = 4, 
x3; = 2.157. However, x is quadratic in each variable x4, x5, and x6 with negative 
leading coefficient, so the minimum occurs at a corner point of [2.37, 2.517]>. We find 
x > x (27, 27, 2.157, 2.517, 2.517, 2.517) © 0.885 > 0. EI 


For the set of Delaunay stars to be a compact topological space, simplices of zero 
volume must be included. Since the circumradius remains bounded, the degenerate 
Delaunay simplices of zero volume are planar quadrilaterals that possess a circumscrib- 
ing circle. 


8.3. Dihedral Angles 


Let dih(S) be the dihedral angle of a simplex S along the first edge. It is the (interior) 
angle formed by faces with edges (2, 1, 6) and (1, 3, 5). 
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Lemma 8.3.1. 
dA/OX4 


cos dih(S) = ; 
(u(x1, x2, X6)U(X1, x3, X5))1/? 


where u is the function defined by (8.1.3). 


The partial derivative of cos dih(S) with respect to x3 is 


2x1(BA /8x2) 
U(X1, X2, X6)!/2u(x1, x3, x5)7/?’ 


so that the sign of the partial derivative is determined by the sign of 0A /0x2. Similar 
considerations apply to the partial derivatives with respect to x2, x5, and x¢. 


Lemma 8.3.2. Let S be a quasi-regular tetrahedron. The the dihedral angle of S along 
any edge is at most arccos(—29003/96999) = dihmax © 1.874444. 


Proof. Suppose that dih(S) > 2/2, so that the numerator n = 0A/dx4 in Lemma 8.3.1 
is negative. To bound cos dih(S) from below, we minimize u(x1, x2, X6), U(x1, X3, Xs), 
and the numerator n over the variables x2, x3, x5, and x6. 

We have 0ju = x; + x, — x; > 0 on the indicated domain, for i = 2, 3, 5, and 6, so 
that 


U(X1, X2, X6)U(X1, X3, X5) > u(xy, 2”, 27)? = (—x7 + 16x)’. 


Similarly, djn = x; + xj — x, > 0, fori = 2, 3,5, and 6, so that 


Wis saig Key = Hs 2, a, 2) = 16 — yp — Ox), 
Thus, 
16 — x; —2. 2; 
0 > cosdih(S) > ————* = 1 a 
16 — xX] 16 — x1 
2(2.51)? —29003 
~ 16 — 2.512 96999 
8.4. The Solid Angles of a Delaunay Simplex 
Let S be a Delaunay simplex with vertices vp, ..., v3. By the solid angle sol; (S$) of the 


simplex at the vertex i, we mean 3 vol(S B;), where B; is a unit ball centered at v;. For 
simplicity, suppose that vo is located at the origin, and that, in the notation of Diagram 
8.1.2, the vertices v1, v2, v3 are the edges of lengths y,, y2, and y3. We write sol for solo. 
Set 


(8.4.1) a(yi, y2,---, Ye) = Yi2¥3 + 4103 + Y3 — YG) 
+ hyo(yt + y3 — y3) + AysQyt + yz — Ye). 
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Lemma 8.4.2. 


2a 
sol($) = 2 arccot Ain |: 


Proof. (See [H2, p. 64].) We use the branch of arccot taking values in [0, zr]. 


The function a is increasing in y;, y2, and y3 on [2, 4] and is decreasing in the variables 
y4, ys, and ye on the same interval. 

We claim that a > 0 for any Delaunay simplex. We prove this in the case A > 0, 
leaving the degenerate case A = 0 to the reader. The area of the face T opposite the 
origin is at most 4/3 (since its edges are all at most 4). There exists a plane that is 
tangent to the unit sphere between this face and the unit sphere [H1, 2.1]. The area of 
the radial projection of T to this plane is less than the area of a disk D of radius 1.5 on 
the plane centered at the point of tangency. The solid angle (that is, the area of the radial 
projection of T to the unit sphere) is less than the area of the radial projection of D to 
the unit sphere). This area is 277 (1 — cos(arctan(1.5))) < a. Lemma 8.4.2 now gives the 
result. This allows us to use Lemma 8.4.2 in the form sol($) = 2 arctan(A!/?/(2a)). 


8.4.3. The solid angle is the area of a spherical triangle. Let x, y, and z be the cosines 
of the radian lengths of the edges of the triangle. By the spherical law of cosines, the 
solid angle, expressed as a function of x, y, and z is 


c(x, y, Z+e(y, Z, x) +c(Z, x, y)—-7, c(x, y, Z) = arccos ( is : 

eil=y 27) 
The partial derivative with respect to x of this expression for the solid angle is 
(-l—xt+y+z)/((x+1)/1), where t = 1 —?- y? —z*+2xyz. The second derivative 
of this expression evaluated at —1 —x + y+z = Ois —2(1 — y)(1 —z)t77/” < 0. So the 
unique critical point is always a local maximum. If neither edge at a vertex of a spherical 
triangle is constrained, then the triangle can be contracted or expanded by moving the 
vertex. If the lengths of the edges are constrained to lie in a product of intervals, then the 
minimum area occurs when two of the edges are as short as possible and the third is at 
one of the extremes. The maximum area is attained when two of the edges are as long 
as possible and the third is at one of the extremes or at a critical point: x = y+z-—1 
(or it symmetries, y=x+z—l1,z=x+y-1). 


8.5. The Compression of a Delaunay Simplex 


The compression I'(S) of a Delaunay simplex S is defined as 


4 * 
(8.5.1) P(S) = —8oer vol(S) + Zee) 


where 
—3n + 12 arecos(1/+/3) 


8.5.2 Soct = = 0.72. 
(8.5.2) 1 a 
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We let ao = a(y1, 2, Y3, Y4, Ys, Yo), Where a is the function defined in (8.4.1). We also 
let a), a2, and a3 be the functions a for the vertices denoted vj, v2, and v3 in Diagram 
8.1.2. For example, a, = a(y1, Ys, Yo. Ya, Y2, Y3). Set t = /A/2. By the results of 
Sections 8.1 and 8.4, we find that [(S) = —d,¢t/6 + y 5 arctan(t/a;). Recall that 
a; > Oonacell C. We wish to give an elementary upper bound of I’ on C. Set, for t > 0 
and a = (do, 41, 42, 43) € R4, 


—Soctt : t 
y(t, a) — a + Yj arctan (~ . 


The partial derivative of y with respect to a; is —2t /(3(t? + a?)) < 0, so y(t,a) < 
y(t,a~), where a~ = (dy , a; , 4, , a, ) is a lower bound of a on C, as determined by 
Section 8.4. We study y(t) = y(t, a_) as a function of ¢ to obtain an upper bound on 
I'(S). We note that 


ait 


3 
oct 2 aj ” 4 
+5 —— and 3 — <0 
6 5 aa) “ iL @+ap < 


Upper and lower bounds on f are known from Section 8.1. An upper bound on y for ¢ 
in [tmins tmax] is the maximum of (tin) and €(tmax), Where € is the tangent to y at any 
point in [tmin, tmax]. 


y'(t)= 


8.6. Voronoi cells 


We assume in this section that the simplex S has the property that the circumcenter of 
each of the three faces with vertex at the origin lies in the cone at the origin over the 
face. This condition is automatically satisfied if these three faces are acute triangles. In 
particular, it is satisfied for a quasi-regular tetrahedron. 

When the cone over a quasi-regular tetrahedron S$ contains the circumcenter of S, 
Section 2 sets vor($) = —46¢ vol(So) + 4so01(S)/3, where Sy is the intersection of S 
with a Voronoi cell at the origin. Otherwise, vor(S) is defined as an analytic continuation. 
This section gives formulas for vol(So). 

As usual, set S = S(y1,..., Yo) and x; = yi Suppose at first that the circumcenter 
of S is contained in the cone over S. The polyhedron So breaks into six pieces, called the 
Rogers simplices. A Rogers simplex is the convex hull of the origin, the midpoint of an 
edge (the first, second, or third edge), the circumcenter of a face along the given edge, 
and the circumcenter of S. Each Rogers simplex has the form 


R= RQ, b,o) = SQ, b.6, =P)" 2 = 2)", 6 =a") 


for some | < a < b < c. Here a is the half-length of an edge, b is the circumradius of a 
face, and c is the circumradius of the Delaunay simplex. 

The volume is vol(R(a, b, c)) = a(b? — a?)'/?(c? — b?)!/?/6. The density 5(a, b, c) 
of R = R(a, b,c) is defined as the ratio of the volume of the intersection of R with a 
unit ball at the origin to the volume of R. It follows from the definitions that 


(8.6.1) vor($) = y°4 vol(R(a, b, c))(—doer + (a, b, c)), 
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where c is the circumradius rad(S), and (a, b) runs over the six pairs 


Yl y2 

(=. nO1, Y25 ve)) ‘ (= n(1, Y2; ve)) , 
y2 ¥3 

(| n(y2, Y3, v4)) ; ee n(y2, Y3; ya) ; 


3 y 
(3.103, 91.99), (3.103. 91.99) 


Upper and lower bounds on vol(R(a, b, c)) follow without difficult from upper and lower 
bounds on a, b, and c. Thus, an upper bound on 4(a, b, c) leads to an upper bound on 
vor($). The next lemma, which is due to Rogers, gives a good upper bound [R]. 


Lemma 8.6.2. The density 5(a, b, c) is monotonically decreasing in each variable for 
l<a<b<ce. 


Proof. Letl <a <b) < c1,1 < a@ < bo < €2,a@ <q, b2 < bi, C2 < c1. The 
points of R(a,, b;, c,) are realized geometrically by linear combinations 


$1 = A(a1, 0, 0) + Ao(a1, (67 — a?)/?, 0) + As(ai, (Bb? — a2)”, (ct — B?)), 


where A;,A2,A3 > O and Ay +Az2+A3 < 1. The points of R(a2, bo, cz) are realized 
geometrically by linear combinations 


8) = Aj (a2, 0, 0) + Az(ap, (Bj — a5)", 0) + As (a2, (B5 — a3)", (c5 — B3)"), 
with the same restrictions on 4;. Then 
|si|? — |Sol” = Ar(Ar + 2A2 + 2A3)(aj — a3) + Aa(A2 + 2A3)(b} — b3) + 23 (Cf — c5). 


So |s;|? > |s2|?. This means that the linear transformation s; +> s> that carries the 
simplex S; to Sy moves points of the simplex S closer to the origin. In particular, the 
linear transformation carries the part in S; of the unit ball at the origin into the unit ball. 
This means that the density of R(a, b1, c,) is at most that of R(az, bo, c2). 


If the circumcenter of S$ is not in the cone over S, then the analytic continuation gives 


vor(S) = » Aer vol(R(a, b, c))(—Socr + 8(a, b, c)), 
R 


where €r = | if the face of the Delaunay simplex S corresponding to R has positive 
orientation, and ¢z = —1 otherwise. (The face of S “corresponding” to R(a, b, c) is the 
one used to compute the circumradius b.) 


8.6.3. A calculation based on the explicit coordinates of S$ and its circumcenter given 
in Lemma 8.1.4 shows that 


X 1 (%2 + X6 — X1)X (X4, X5, XZ, X1, X2, XG) 
48u(x1, 2X2, X6)A(X1,..., X6)!/? 


en vol (R (2, n(v1, ya, yo), tad(S))) = 
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If the circumcenter of S is not contained in S, then the same formula holds by analytic 
continuation. By definition, ¢g = —1 exactly when the function x is negative. Although 
this formula is more explicit than the earlier formula, it tends to give weaker estimates 
of vor(S) and was not used in the calculations in Section 9. 


8.6.4. There is another approximation to vor(S) that will be useful. Set S, = 
S(2, 2,2, y, y, y). (We hope there is no confusion with the previous notation $;.) For 
1 <a <b <c, let vol(R(a,b,c)) = a((b? — a’*)(c? — b*))'/? /6 be as above. Set 
r(a) = vol(R(a, n(2, 2, 2a), 1.41)). 


Lemma 8.6.5. Assume that the circumradius of a quasi-regular tetrahedron S is at 
least 1.41, and that 6 < y; + yo + y3 < 6.3. Seta = (y1 + y2 + y3 — 4)/2. Pick y to 
satisfy sol(Sy) = sol(S). Then 


vor(S) < vor(Sy) — 88ocr (: - =) r(a) 


a 


Proof. If S is any quasi-regular tetrahedron, let Sa, be the simplex defining the “tan- 
gent” Voronoi cell, that is, S,,, 1s the simplex with the same origin that cuts out the same 
spherical triangle as S on the unit sphere, but that satisfies yj) = y2 = y3 = 2. The lengths 
of the fourth, fifth, and sixth edges of Sian are between 4/8 — 2(2.3) = /3.4 and 2.51. 
The faces of S,an are acute triangles. A calculation similar to the proof in paragraph 8.2.5, 
based on x (3.4, 3.4, 4, 4, 4, 2.5 12) > 0, shows that the circumcenter of Stan is contained 
in the cone over Stan. 

Since Stan is obtained by “truncating” S, we observe that vor(Stan) — 46cr vol(§ Sica), 
where $ and Si are the pieces of Voronoi cells denoted A in Section 2 for S and Stan, 
respectively. By a convexity result of Fejes Toth, vor(Stan) < vor(S,) [FT, p. 125]. (This 
inequality relies on the fact that the circumcenter of Sta, is contained in the cone over 
Stan-) 

Let a, be the half-length of the first, second, or third edge. Since vol(R(a,, b, c)) is 
increasing in c, we obtain a lower bound on vol(R (a1, b, c)) forc = 1.41, the lower bound 
on the circumradius of S. The function vol(R(a,, b, 1.41)), considered as a function of 
b, has at most one critical point in [a;, 1.41] and it is always a local maximum (by a 
second derivative test). Thus, 


(8.6.6) vol(R(a1, b, c)) => min(vol(R (a1, Dmin, 1.41)), vol(R(a1, bmax, 1.41))), 


where Dyin and Dmax are upper and lower bounds on b € [a;, 1.41]. A lower bound on 
b is n(2, 2, 2a,), which means that vol(R(q1, Dmin, 1.41)) may be replaced with r(a;) 
in inequality (8.6.6). By Heron’s formula, the function 7(a, b, c), for acute triangles, is 
convex in pairs of variables: 
> na? +b — (a —b? +.?)(-a +b? +.?)(a? +b? +c’) r 

Naa Nbb Nab = gob 4 > VU. 
Thus, an upper bound on Dmax is 7(2, 2.51, 2a), where a = (y; + y2 + y3 — 4)/2. This 
means that 


vol(R((a1, Dax, 1.41))) 
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may be replaced with the function vol(R(qa1, 7(2, 2.51, 2a), 1.41)) in inequality (8.6.6). 
Now | < a < a, and vol(R(a,, b, c)) is decreasing in the first variable (for 1 < a; and 
b< af 2): so we may use the lower bound 


vol(R(a, (2, 2.51, 2a), 1.41)) 


instead. By Calculations 9.20.2 and 9.20.3, we conclude that vol(R(a1, b,c)) => r(a). 
This lower bound is valid for each Rogers simplex. The volume of $\ Stan is then at least 


8 8 8 
Yj V3 ¥3 
The concavity of 1 — 8/y? gives )-?_,(1 — 8/y?) => 1 — 1/a°. We have established that 


vol($\ Stan) > (1 os =) 2r(a). 


a 


The result follows. 


8.6.7. We conclude our discussion of Voronoi cells with a few additional comments 
about the case in which analytic continuation is used to define vor(S), with S a quasi- 
regular tetrahedron. Assume the circumcenter c of S lies outside S and that the face T 
of S with negative orientation is the one bounded by the first, second, and sixth edges. It 
follows from Section 3 that y,, y2, ye € [2.3, 2.51] and y3, y4, ys € [2, 2.15]. 


Let p, (resp. pz) be the point on T equidistant from the origin, v3, and v; (resp. v2). 


ipa xu (x1, X2, X6)(—x1 + x3 + x5)? 
ae A(OA /dxay? 


Let po be the circumcenter of T (see Diagram 8.6.8). 


Mi 


0 


Diagram 8.6.8 


€r = —1 for the Rogers simplex with vertices the origin, po, c, and v;/2. It lies 
outside S. The other Rogers simplex along the first edge has er = 1, so that the part 
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common to both of these Rogers simplices cancels in the definition of vor(S). This means 
that vor(S) becomes the sum of the usual contributions from the two Rogers simplices 
along the third edge, 


4vol(R(a, b, c))(—89er + 6(a, b, c)) 


for (a, b, c) = (y1/2, n(V1, 3, Ys), |Pil) and (y2/2, n(y2, ¥3, Ya), |p2|), and 
v(S”) := 485, vol(S”) — | 
where S” is the convex hull of 0, po, p1, p2, and c. 

We claim that in any cell satisfying the constraints given above, |p| is minimized 
by making yj, y2, yo as large as possible and y3, y4, ys as small as possible. To show 
this, the derivatives have to be written explicitly from the formula for | p;|? given above. 
The partial derivatives with respect to x2, ..., x6 factor into products of the polynomials 
0A /0x;, fori = 3,4,5, x1, (43 +xX5 — 2X1), U(X1, X2, X6), (HX 1X4 + X2X5 +.X1X6 — X5X6), 
and (x1x2 — X2x3 — x1x4 + x3X6). The signs of these polynomials are easily determined. 
The partial with respect to x; is complicated (the numerator has 88 terms), and we had 
to resort to the method of subdivision to determine its sign. We omit the details. 

To complete our estimate, we describe an upper bound on v(S”). The ratio sol($”)/ 
(3 vol(S”)) is at least 1/rad(S)*, because S$” is contained in a sphere of radius rad(S), 
centered at the origin. We have 


vol(S”) 
2 


because the convex hull of 0, po, pi, and c, which is one side of S”, is a pyramid with 
base the right triangle (p;, po, c) and height at most |po| = (1, y2, yo). Of course, 
|c— pol? = rad(S)*— nV, Y2, ye). We now have a bound on v(S”) in terms of quantities 
that have been studied in Section 8, if we rely on the bound | pp — p;| < 0.1381. Write 
Pi = piQ1,---, Yo). This bound is obtained from the following inequalities: 


2 ae ( 2 ae 
n(V1, Y2, Yo) ri Ipil q 
x] 


1/2 oer 
(101, 2.51, 2.517 — 2) © — (Ipi1,2-51,2,2,2,2.5DP — 2) 


< glpo — pille — pollpol. 


|Po— pil = 


IA 


IA 


as" 
(nasi, 251, 951)= =) 


2.512\ 1? 
= (1u251,2.51,2, 20 95 = ) 


< 0.1381. 


The inequality that replaces y; with 2.51 results from Calculation 9.21. 


8.7. A Final Reduction 


Let S be a Delaunay simplex. Suppose that the lengths of the edges y;, ys, and ye are 
greater than 2. Let S’ be a simplex formed by contracting the vertex joining edges 1, 5, 
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and 6 along the first edge by a small amount. We assume that the lengths y), ys, and y¢ 
of the new edges are still at least 2 and that the circumradius of S’ is at most 2, so that 
S’ is a Delaunay simplex. 


Proposition 8.7.1. [(S’) > P(S). 


We write sol;, fori = 1, 2, 3, for the solid angles at the three vertices p;, p2, and p3 
of S terminating the edges 1, 2, and 3. Let p{ be the vertex terminating edge 1 of S’. 
Similarly, we write sol;, fori = 1, 2, 3, for the solid angles at the corresponding vertices 
of S’. We set vol(V) = vol(S) — vol(S’) and w; = sol; — sol’. It follows directly from 
the construction of S’ that w2 and w3 are positive. The dihedral angle a along the first 
edge is the same for S and S’. The angle £; of the triangle (0, p1, p;) at p; is less than 
the angle f; of the triangle (0, p}, p;) at p|, fori = 2, 3. It follows that w is negative, 
since —w is the area of the quadrilateral region of Diagram 8.7.2 on the unit sphere. 


PY & -B 
N} B, 

P2 Ww 
s' 
Y 


Diagram 8.7.2 


Lemma 8.7.3. doc; vol(V) > w2/3 + w3/3. 


The lemma immediately implies the proposition because w; < 0 and 


Pisi-Trisje— "= = ge ev) 
= 3 3 3 oct - 


Proof. Let T’ be the face of S’ with vertices p'|, p2, and p3. We consider S’ as a function 
of t, where f is the distance from p; to the plane containing 7’. (See Diagram 8.7.2.) It 
is enough to establish the lemma for f infinitesimal. 

As shown in Diagram 8.7.2, let V; be the pyramid formed by intersecting V with the 
plane through p; that meets the fifth edge at distance % = 1.15 from p3 and the sixth 
edge at distance fg from p2. Also, let the intersection of V with a ball of radius fg centered 
at p; be denoted V;, fori = 2, 3. For ¢t sufficiently small, the region Vj is (essentially) 
disjoint from V2 and V3. 

We claim that vol(V,) > vol(V2 V3). Let 6 be the angle of T’ subtended by the fifth 
and sixth edges of S’. Then vol(V,) = Br/3, where B is the area of the intersection of 
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V, and T’: 


sin 6 (ys — t0)(¥6 — to) 
5 . 
Since S’ is a Delaunay simplex, the estimates 17/6 < 6 < 27/3 from [H1, 2.3] hold. In 
particular, sind > 0.5, so B > 0.25(2 — 1.15)”, and vol(V;) > 0.06r. 
If vol(V2 M V3) is nonempty, the fourth edge of S must have length less than 279. The 
dihedral angle a’ of V along the fourth edge is then less than the tangent of the angle, 
which is at most ¢ + O(t7), in Landau’s notation. As in [H1, 5], we obtain the estimate 


_ (t — 1)? tp + Dea’ 
7 3 


(8.7.4) Re 


< 0.025t + O(t?). 


to 
vol(V2M V3) < wf te —t? dt 
1 


This establishes the claim. 
Thus, for ¢ sufficiently small 


Soct VOI(V) = Socr (vol(V1) + vol(V2) + vol(V3) — vol(V2M V3)) 
(M422) core 
> Boatd) (5 +5 > 1.09 (5 ++). o 


9. Floating-Point Calculations 


This section describes various inequalities that have been established by the method of 
subdivision on SUN workstations. The full source code (in C++) for these calculations 
is available [H6]. 

Floating-point operations on computers are subject to round-off errors, making many 
machine computations unreliable. Methods of interval arithmetic give users control over 
round-off errors [AH]. These methods may be reliably implemented on machines that al- 
low arithmetic with directed rounding, for example, those conforming to the IEEE/ANSI 
standard 754 [W], [IEEE], [P]. 

Interval arithmetic produces an interval in the real line that is guaranteed to contain the 
result of an arithmetic operation. As the round-off errors accumulate, the interval grows 
wider, and the correct answer remains trapped in the interval. Apart from the risk of 
compiler errors and defective hardware, a bound established by interval arithmetic is as 
reliable as a result established by integer arithmetic on a computer. We have used interval 
arithmetic wherever computer precision is a potential issue (Calculations 9.1—9.19, in 
particular). 

Every inequality of this section has been reduced to a finite number of inequalities of 
the form r(xo) < 0, where r is a rational function of x € R” and x9 is a given element in 
R". To evaluate each rational expression, interval arithmetic is used to obtain an interval 
Y containing r(xo). The stronger inequality, y < 0 for all y € Y, which may be verified 
by computer, implies that r(xo) < 0. 

To reduce the calculations to rational expressions r(xq), rational approximations to 
the functions ./x, arctan(x), and arccos(x) with explicit error bounds are required. 
These were obtained from [H7]. Reliable approximations to various constants (such as 
mz, V2, and docr) with explicit error bounds are also required. These were obtained in 
Mathematica and were double checked against Maple. 
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Let S = S(y1,..., ye) be a quasi-regular tetrahedron. We label the indices as in 
Diagram 8.1.2. Let F = V'(S(j1,..-., y6)) be the compression. We also let dih = 
dih(S(y;,..., ¥6)) be the dihedral angle along the first edge, and let sol = 
sol(S(y1,..., ¥6)) be the solid angle at the origin, that is, the solid angle formed by 


the first, second, and third edges (1, y2, y3) of S. 

All of the following inequalities are to be considered as inequalities of analytic func- 
tions of yj,..., y6. Although each of the calculations is expressed as an inequality 
between functions of six variables, Proposition 8.7.1 has been invoked repeatedly to 
reduce the number of variables to three or four. For instance, suppose that we wish 
to establish (sol, I") < 0, where J (sol, I) is an expression in P'(S(y,..., y6)) and 
sol(S(y1, ..., ¥6)). Invoking Proposition 8.7.1 three times, we may assume that the ver- 
tices marked vj, v2, and v3 in Diagram 8.1.2 each terminate an edge of minimal length. 
It is then sufficient to establish the inequality in seven situations of smaller dimension; 
that is, we may assume the edges i € J have minimal length, where / is one of 


{1, 4}, {2, 5}, {3, 6}, {4, 5}, {4, 6}, {5, 6}, {1, 2, 3}. 
Similarly, for an inequality in I and dihedral angle, we reduce to the seven cases 
I = {1,4}, {2, 5}, {3, 6}, (1, 2, 3}, {1, 5, 6}, (3, 4, 5}, {2, 4, 6}. 


The first two calculations are inequalities of the compression I’ of a quasi-regular tetra- 
hedron. 


Calculation 9.1. I < 1 pt. 


This first inequality and Calculation 9.3 are the only ones that are not strict inequalities. 
Set So = S(2, 2,2, 2, 2, 2). By definition, [(So) = 1 pt. We must give a direct proof 
that So gives the maximum in an explicit neighborhood. Then we use the method of 
subdivision to bound I’ away from | pt outside the given neighborhood. An infinitesimal 
version of the following result is proved in [H1]. 


Lemma 9.1.1. Jf y; € [2, 2.06], fori = 1,...,6, then T(SQ1,.--,¥6)) < (So), 
with equality if and only if S(y,,.--, ¥6) = So. 


Set ago = a(2, 2, 2,2, 2,2) = 20, Ap = A(2?, 2”, 27, 27, 27, 27), t) = VW Ap/2 = 
4/2, and by = (1 + tii Gea) = 50/81. Set f = max;(y; — 2) < 0.06 anda~ = 
a(2,2,2,2+ f,2+ f,2+ f) =20- 12f —3f7?. As in Section 8, we set t = /A/2. 
We have arctan(x) < arctan(x9) + (x — x9)/( + Ks if x, x9 > 0. We verify below that 
A > Ao under the restrictions given above. Then 


3 
(9.1.2) TCS) = — "eat + 5 zona +} 


i=0 


rs va =) Dae »(4- “) 


IA 
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Socr(A — A ae ee: 
1( 0) b> cs! 


= [(So) -— 
00 TAG 4p) Ls a, 
_2 t boty < — aj)? 
boto “at pe le > (doo — ai) 
Ao i= 49 70 Gi 


IA 


(A = Ao) Soct boto 
T Si —~ Uy 
(So) + ars ( mA a) + ae Ye ai) 


3 


+ Fe Dao — a 

* aa =0 
Set co = —Soer/24 + bo/a~ > 0. Write y, = 2+ fj, withhO < fj < f. Setx; = 
4+ ¢; y? = 44 4f; + f?. Then f; < e;/4, fori = 1,...,6. Sete = 4f + f?. 
Set A(e),..., 6) = A(x, ..., x6). We find that 0?A/de2 de} = 44+ e4 +e5 —e5 > 0 
and similarly that 07A/de;de, > 0, fori = 3,5, 6. So JA/de, is minimized by taking 
€2 = €3 = €5 = &6 = 0, and this partial derivative is at least 


aA ‘ 
5,0 0, 0, e4, 0, 0) = 16 — 8e; — 2e;e4 — ef = 16 — 8e — 3e° > 0. 
e} 


Thus A > Ao. The partial derivative aA /de, is at most 
aA 5 ‘ 
ie e,e,e4,e,e) = 16+ l6e — 8e; + 4ee4 — 2e;e4 — ej < 164+ 16e + 4ee4 — 3 

el 


16 + 16e + 3e?. 


So A — Ap < (16+ 16e + 3e7)(e, +---+ 6). 

We expand + G —doo) = 5(ey +--+: +e6) +h, +h2+h3 as asum of homogeneous 
polynomials h; of degree i in f},..., fe. A calculation shows that h; = 0. Also hz is 
quadratic in each variable f; with negative leading coefficient, so h2 attains its minimum 
at an extreme point of the cube [0, f]°. A calculation then shows that hy > —14f? on 
[0, f]°. By discarding all the positive terms of /3, we find that 


hs3>—fifs-fiff -BA-hK-K fe — Af = -6f°. 


Thus, ye — ao) > 5(e) +--+ +6) — 1 i _ 6, Since e = max(e;), we find 
that f < e/4 < (e; +--- + e6)/4. This gives 


3 7 2 
Yau = a) < (e, +o 466) (~ s+ ty it), 
i=0 2 2 


IA 


Similarly, we expand > (ago — aj)* as a polynomial in f;,..., f6. To obtain an upper 
bound, we discard all the negative terms of the polynomial and evaluate all the positive 
terms at (f|,..., fo) = (f,..., f). This gives 
3 
Y-(aoo —wy < 4944 7° +7206 f* + 3684 f* + 828 f° +73" 
i=0 


IA 


T(er +--+ + e)(4944 f + 7296 f? + 3684 f? + 828 f* +73 f°). 
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We now insert these estimates back into inequality (9.1.2). This gives 


MS) — P'(So) co », , bolo 9 
9.1.3 < 16+ l6e + 3e*) + —-(-5 + 3.5f + 1.5 
( Sea oI e + 3e*) a f f°) 
boto 2 3 4 5 
+ Zp = (4944 f +7296 f? + 3684/3 + 828 f4 + 73%). 
00 


The right-hand side of this inequality is a rational function of f. (Both a~ and e depend 
on f.) Each of the three terms on the right-hand side is increasing in f. Therefore, the 
right-hand side reaches its maximum at f = 0.06. Direct evaluation at f = 0.06 gives 
(S$) — (So) < —0.00156(e; + --- + e6). 

To verify Calculation 9.14, the computer examined only 7 cells. Calculation 9.1 
required 1899 cells. Calculation 9.6.1 required over 2 million cells. The number of cells 
required in the verification of the other inequalities falls between these extremes. 


Calculation 9.2. [ < 0.5 pt, if y, € [2.2, 2.51]. 


The next several calculations are concerned with the relationship between the dihedral 
angle and the compression. 


Calculation 9.3. dih(S(y1,...,Y6)) > dibmin i= dih(S$(2, 2.51, 2,2,2.51,2)) ~ 
0.8639. 


Since this bound is realized by a simplex, we must carry out the appropriate local 
analysis in a neighborhood of $(2, 2.51, 2, 2, 2.51, 2). 


Lemma 9.3.1. Suppose that 2 < y; < 2.2, fori = 1,3, 6, and that 2 < y; = 2.51, for 
i = 2,4,5. Then dih(S(y1,..-, y6)) > ditmin. 


Proof. This is an application of Lemma 8.3.1. By that lemma, dih(S) is increasing in 
X4, SO we fix x4 = 4. The sign of 0 cos dih/dx is the sign of 0A /dx3, and a simple 
estimate based on the explicit formulas of Section 8 shows that 0A /dx3 > 0 under the 
given constraints. Thus, we minimize dih(S) by setting x2 = 2.517. By symmetry, we 
set x5 = 2.517. 

Now consider dih(S) as a function of x), x3, and x6. The sign of 0 cos dih/0dx3 is 
the sign of dA(x1, 2.517, x3, 27, 2.517, x6)/0x2. The maximum of this partial (about 
—2.39593) is attained when x1, x3, and x6 are as large as possible: x; = x3 = %6 = 227. 
So dih(S) is increasing in x3. We take x3 = 4, and by symmetry x6 = 4. 

We have 

d cos dih(S) 2t) (x1) 


Ox] U(X1, X2, X6)3/2u(X1, X3, X5)3/2” 


where f;(x,) © 464.622 — 1865.14x; + 326.954x? — 92.9817x? + Axt. An estimate 
of the derivative of t;(x) shows that t,(x) attains its maximum at x, = 4, and t,(4) < 
—6691 < 0. Thus dih(S) is minimized when x; = 4. 


Calculation 9.4. [ < 0.378979 dih — 0.410894. 
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Lemma 9.5. Jf S,, Sx, 83, and S4 are any four tetrahedra such that dih(S,) + dih(S2) + 
dih($3) + dih(S4) > 277, then T'(S;) +---+T(S4) < 0.33 pt. 


This lemma is a consequence of the following three calculations, each established by 
interval arithmetic and the method of subdivision. 


Calculation 9.5.1. 
Tl < —0.19145 dih + 0.2910494, 


provided dih € [dih(S(2, 2, 2, 2, 2, 2)), 1.42068]. 


Calculation 9.5.2. 
TI < —0.0965385 dih + 0.1562106, 


provided dih € [1.42068, dih(S(2, 2, 2, 2.51, 2, 2))]. 


Calculation 9.5.3. 
Tl < —0.19145 dih+ 0.31004, 


provided dih > dih(S(2, 2, 2, 2.51, 2, 2)). 


To deduce Lemma 9.5, we consider the piecewise linear bound ¢ on I obtained 
from these estimates. The linear pieces are €;(x) < 1 pt, €2, £3, and £4 on [dihmin, d1], 
[d,, dz], [d2, d3], and [d3, dihmax], where d; = dih(S(2, 2, 2, 2, 2, 2)),dzy = 1.42068, and 
d3 = dih(S(2, 2, 2, 2.51, 2, 2)). (See Calculation 9.1 and Lemma 8.3.2.) Diagram 9.5.4 
illustrates these linear bounds. (There are small discontinuities at d), d>, and d3 that 
may be eliminated by replacing ¢; by €; + ¢;, for some e; > 0, fori = 1, 2, and 
4.) We then ask for the maximum of ¢(t;) + €() + €(t3) + €(t4) under the constraint 
(ttt+h+h+t4)/4 => m/2 € [do, dz]. Since €(x) is constant on (dihmin, d)) and decreasing 
on [d1, dihmax], we may assume that t; > d,, for all i. Since the slope of £2 is equal to 
the slope of £4, we may assume that ¢; > do, for all i, or that 4; < d3, for alli. If t; < ds, 
for all 7, then we find that t; > 27 — 3d3 > do. So in either case, t; > do, for all i. By 
convexity, an upper bound is 423(7/2) < 0.33 pt. 


iT b,b, hl 
1 pt 


0 pt 
dod 


Diagram 9.5.4 
Lemma 9.6. Jf S,,..., Ss are any five tetrahedra such that dih(S,) +---+ dih(Ss) > 


21, then 
T(S,) +---+10(S5) < 4.52 pt. 


This is a consequence of two other calculations. 
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Calculation 9.6.1. [ < —0.207045 dih + 0.31023815, provided dih € [do, 27 —4dbo], 
where dp = dih(S(2, 2, 2, 2, 2, 2)). 


Calculation 9.6.2. TP < 0.028792018, if dih > 27 — 4db. 


Calculations 9.1, 9.6.1, and 9.6.2 give a piecewise linear bound ¢ on I as a function 
of dihedral angle. See Diagram 9.6.3. (Again, there are minute discontinuities that may 
be eliminated in the same manner as before.) We claim that 4.52 pt > €(t;) +---+ (ts) 
whenever ft; + ---+¢5 > 27. As in Lemma 9.5, we may assume that t; > do. Since £ 
is decreasing on [do, dihmax], we May assume that t; + ---+¢s; = 27. Thus, only the 
interval [do, 27 — 4do] is relevant for the optimization. On this interval, the bound is 
linear, so €(t;) + ---+ €(t5) < S5€(20/5) < 4.52 pt. 


1 pt 
I(x) 


do 
Diagram 9.6.3 
Calculation 9.7. T° < 0.389195 dih — 0.435643, if y) > 2.05. 
The next inequalities relate the solid angles to the compression. 
Calculation 9.8. I < —0.37642101 sol + 0.287389. 
Calculation 9.9. T < 0.446634 sol — 0.190249. 


Calculation 9.10. T < —0.419351 sol + 0.2856354 + 0.001, if y; € [2, 2.1], fori = 
Ds dp a5: 


The following calculation involves the circumradius. We leave it to the reader to check 
that the dimension-reduction techniques of Lemma 8.7.1 may still be applied. 


Calculation 9.11. T < —0.419351 sol+0.2856354, provided that y4, ys, ye > 2.1 
and that the circumradius of S is at most 1.41. 


Calculation 9.12. TI < —0.419351 sol + 0.2856354 — 5(0.001), if y; > 2.1 for some 
i, and yg € [2, 21]. 


Calculation 9.13. < —0.65557 sol+ 0.418, if y; € [2, 2.1], fori = 1,..., 6. 
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Calculation 9.14. sol(S) > 0.21. 


Calculation 9.15. I + (K — sol)/3 < 0.564978 dih — 0.614725, where K = 
(4a — 6.48)/12, provided y, € [2, 2.05], and yo, y3 € [2, 2.2]. 


Calculation 9.16. dih > 0.98 and sol > 0.45, provided y; € [2, 2.05], and yz, y3 € 
[2;.2.21. 


Let I'(S) be replaced by vor(S) in each of the Calculations 9.1—9.16 to obtain a new 
list of inequalities 9.1’-9.16’. (In 9.11’ we drop the constraint on the circumradius of S.) 
We claim that all of the inequalities 9.*’ hold whenever S is a quasi-regular tetrahedron of 
circumradius at least 1.41. In fact, inequalities 9.1’, 9.2’, 9.4’, 9.5.1’, 9.5.2’, 9.5.3’, 9.6.1’, 
9.6.2’, and 9.7’ follow directly from 9.17 and the inequalities dihmin < dih < dihmax. 
Calculations 9.3, 9.14, and 9.16 are independent of I’, and so do not require modification. 
Inequalities 9.8’, 9.11’, and 9.12’ also rely on 9.17, 9.18, and 9.19, inequality 9.9’ on 9.14, 
and inequality 9.15’ on 9.16. Inequalities 9.10’ and 9.13’ are vacuous by the comments 
of paragraph 8.2.4. 

Write S = S(y1, y2, y3, ¥4, ¥5, Yo), Vor = vor(S), and let rad = rad(S) be the 
circumradius of S. 


Lemma 9.17. /f the circumradius is at least 1.41, then vor < —1.8 pt. 


Proof. Tf sol > 0.91882, then the lemma is a consequence of Lemma 9.18. (The proof 
of Lemma 9.18, under the restriction sol > 0.91882, is independent of the proof of this 
lemma.) Assume that sol < 0.91882. 

If S’ and S and Delaunay stars, related as in Section 8.7, then vor(S’) > vor(S) 
because Be is obtained by slicing a slab from So. This means that we may apply the 
dimension-reduction techniques described at the beginning of this section, unless the 
deformation decreases the circumradius to 1.41. An interval calculation establishes the 
result when y; + y2 + y3 > 6.3 and the circumradius constraint is met (Calculation 
9.17.1). When the dimension-reduction techniques apply, the dimension of the search 
space may be reduced to four, and an interval calculation similar to the others in this 
section gives the result (Calculation 9.17.2). 

We lacked the computer resources to perform the interval analysis directly when the 
dimension-reduction techniques fail and found it necessary to break the problem up into 
smaller pieces when y; + y+2+ y3 < 6.3. We make use of the following calculations. 


Calculation 9.17.1. vor(S) < —1.8 pt provided y; + y2 + y3 > 6.3, rad($) = 1.41, 
and sol(S) < 0.91882. 


Calculation 9.17.2. vor(S) < —1.8 pt provided that rad($) > 1.41, yityo+y3 > 6.3, 
sol(S) < 0.912882, and S lies in one of the seven subspaces of smaller dimension 
associated with / (sol, I’) at the beginning of the section. 
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Calculation 9.17.3.1. Assume that y; + yo + y3 < 6.3 and that rad > 1.41. Then 
sol > 0.767. 


Calculation 9.17.3.2._ Assume that y; + yo + y3 < 6.192 and that rad > 1.41. Then 
sol > 0.83. 


Calculation 9.17.3.3. Assume that y; + y2 + y3 < 6.106 and that rad > 1.41. Then 
sol > 0.87. 


Calculation 9.17.3.4. Assume that y; + y2 + y3 < 6.064 and that rad > 1.41. Then 
sol > 0.9. 


Calculation 9.17.3.5. Assume that y; + y2 + y3 < 6.032 and that rad > 1.41. Then 
sol > 0.91882. 


In the interval arithmetic verification of Calculations 9.17.3, we may assume that 
rad = 1.41 and that y,; + y2 + y3 is equal to the given upper bound 6.3, 6.192, etc. To 
see this we note that the circumradius constraint is preserved by a deformation of S that 
increases y;, y2, or y3 while keeping fixed the spherical triangle on the unit sphere at 
the origin cut out by S. We increase y;, y2, and y3 in this way until the sum equals the 
given upper bound. Then fixing y;, y2, and y3, and one of y4, ys, and yo, we decrease 
the other two edges in such a way as to decrease the solid angle and circumradius until 
rad = 1.41. 

This deformation argument would break down if we encountered a configuration in 
which two of y4, ys, and yo equal 2, but this cannot happen when rad($) > 1.41 because 
this constraint on the edges would lead to the contradiction 


1.41 < rad(S) < rad(S(2.3, 2.3, 2.3, 2, 2, 2.51)) < 1.39 


It is possible to reduce Calculations 9.17.3 further to the four-dimensional situation 
where two of y4, ys and y¢ are either 2 or 2.51. Consider a simplex S with vertices 0, v;, 
v2, and v3. Let p; be the corresponding vertices of the spherical triangle cut out by S on 
the unit sphere at the origin. Fix the origin, v2, and v3, and vary the vertex v;. The locus 
on the unit sphere described as p, traces out spherical triangles of fixed area is an arc of a 
Lexell circle C. Define the “interior” of C to be the points on the side of C corresponding 
to spherical triangles of smaller area. The locus traced by v; on the circumsphere (of S) 
with |v;| constant is a circle. Let C’, also a circle, be the radial projection of this locus 
to the unit sphere. Define the “interior” of C’ to be the points coming from larger |v;|. 
The two circles C and C’ meet at p,, either tangentially or transversely. The interior of C 
cannot be contained in the interior of C’ because the Lexell arc contains p3, the point on 
the unit sphere antipodal to p2 [FT, p. 23], but the interior of of C’ does not. Furthermore, 
if |v;| = 2 and |v2| + |v3| > 4, then v2 or v3 lies in the interior of C and C’, so that 
the circles have interior points in common. This means that v; can always be moved in 
such a way that the solid angle is decreasing, the circumradius is constant, and the length 
|v;| is decreasing (or constant if |v;| = 2). If any two of ya, ys, and ye are not at an 
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extreme point, this argument can be applied to v1, v2, or v3 to decrease the solid angle. 
This proves the reduction. 

We are now ina position to prove the lemma for simplices satisfying yj; +yo+y3 < 6.3. 
Calculation 9.17.3.1 allows us to assume sol > 0.767. As in Section 8.6, let S, = 
S(2, 2,2, y, y, y). Werely on the fact that vor(S,,) is decreasing in y, for y € [2.26, 2.41]. 
In fact, the results of Section 8.6 specialize to the formula 


88 pcr¥” 
(2 = 97) (16 = 97) 


_ y2)1/22 
(9.17.4)  vor(Sy) = a) ; 


+ 3 arctan ( eee 6y2 
and the sign of its derivative is determined by a routine Mathematica calculation. It is clear 
that sol(S,,) is continuous and increasing in y. Since sol( S26) < 0.767 and sol(Sz4)) > 
0.91882, our conditions imply that sol(.$) = sol(S,) for some y € [2.26, 2.41]. Let r(a) 
and vol(R(a, b, c)) be the functions introduced in Section 8.6.4. 

This suggests the following procedure. Pick y so that sol(S) > sol(S,). Calculate the 
smallest (or at least a reasonably small) a for which 


&(y, a) = vor(S\) — 48ocr (: = =) 2r(a) 


is less than —1.8 pt. Monotonicity (Calculation 9.20.1) and Lemma 8.6.5 imply that 
vor(S) < —1.8 pt, if yityo2+y3 > 2(2+a). To treat the case that remains (y;+ yo2+y3 < 
2(2 + a)), use Calculations 9.17.3 to obtain a new lower bound on sol($), and hence 
a new value for y. The procedure is repeated until a = 0.016. Calculation 9.17.3.5 
completes the argument by covering the case y; + y2 + y3 < 6.032. We leave it to the 
reader to check that 


& (2.2626, 1.096), &(2.326, 1.053), &(2.364, 1.032), €(2.391, 1.016) 
are less than —1.8 pt and that 
Sol(S3.2626) < 0.767, sol(S2.326) < 0.83, sol(Sz.364) < 0.87, sol(S2 391) < 0.9. 


This completes the proof of Lemma 9.17. O 
Lemma 9.18. /frad(S) > 1.41, then vor < —0.419351 sol + 0.2856354. 


Proof. We adopt the notation and techniques of Lemma 9.17. If sol < 0.918819, 
then the result follows from Lemma 9.17. (The proof of Lemma 9.17 is indepen- 
dent of the argument that follows under that restriction on solid angles.) Let f(S) = 
—0.419351 sol(S) + 0.2856354 — vor(S). We show that f(S) is positive. We use the 
inequality f(S) > f (Stan) + 48ocer vol(S Sean) of Lemma 8.6.4. A routine calculation 
based on formula (9.17.4) shows that f(S,) is increasing, for y € [2.4085, 2.51]. 

If sol > 0.951385, then we appeal to Fejes Téth’s convexity argument described in 
Section 8.6. To justify the use of his argument, we must verify that the cone over Stan 
contains the circumcenter of S. The first, second, and third edges have length 2, and the 
fourth, fifth, and sixth edges are between 2.21 and 2.51 (Calculation 9.18.2). These are 


425 


46 T. C. Hales 


stronger restrictions on the edges than in the proof of Lemma 8.6.5, so the justification 
there applies here as well. We observe that sol($2.4366) < 0.951385 so that 


F(S) = f(Stan) = f (Sy) = f(S2.4366) © 0.000024 > 0, 


where y satisfies sol(S,) = sol( Stan). We may assume that 0.918819 < sol < 0.951385. 


Lemma 9.18.1. The combined volume of the two Rogers simplices along a common 
edge of a quasi-regular tetrahedron S is at least 0.01. 


Proof. The combined volume is at least that of the right-circular cone of height a and 
base a wedge of radius Vb? — a? and dihedral angle dihmin, where a is the half-length 
of an edge and 5 is a lower bound on the circumradius of a face with an edge 2a. This 
gives the lower bound of 


(b> —a’)a 


dibmin 2 2 
> 0.1439(b* — a“). 
ms 


We minimize b by setting b? = n(2,2,2a)? = 4/(4 - a’). Then b? — a? = 


(a? — 2)7/(4 — a’), which is decreasing ina € [1, 2.51/2]. Thus, we obtain a lower 
bound on b* — a? by setting a = 2.51/2, and this gives the estimate of the lemma. 0 


We remark that sol($3.4985) < 0.918819. If 1.15 < (y; + 92+ y3 —4)/2, then Lemma 
9.18.1 and Section 8.6 give 


1 
FS) = fF (S2.4085) + 48oer (1 - a) 0.01 > 0. 


(Analytic continuation is not required here, because of the constraints on the edges in 
Calculation 9.18.2.) We may now assume that y; + y2 + y3 < 6.3. To continue, we need 
a few more calculations. 


Calculation 9.18.2. If sol > 0.918, then y4, y5, yo > 2.21. 

Calculation 9.18.3.1. If y; + yo + y3 < 6.02 and rad > 1.41, then sol > 0.928. 
Calculation 9.18.3.2. If y; + yo + y3 < 6.0084 and rad > 1.41, then sol > 0.933. 
Calculation 9.18.3.3. If y; + y2 + y3 < 6.00644 and rad > 1.41, then sol > 0.942. 


In the verification of these calculations, we make the same reductions as in Calcula- 
tions 9.17.3. 

Adapting the procedure of Lemmas 9.17 and 8.6.5, we fine that a lower bound on the 
solid angle leads to an estimate of a constant a with the property that f(S) > 0 whenever 
yity2+y3 => 44+2a. That is, we pick a so that &)(y, a) := f(Sy)+88oe¢(U — 1/a*)r(a) 
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is positive, where y is chosen so that sol(S,) is a lower bound on the solid angle. The 
values 


&1 (2.4085, 1.01), (2.4165, 1.0042), &) (2.42086, 1.00322), &) (2.4286, 1.0017) 


are all positive. This yields the bound y; + y2 + y3 < 6.0034. 

Assume that S satisfies y,) + y2 + y3 < 6.0034 and rad($S) > 1.41. Then by Cal- 
culation 9.18.3.3, sol($) = sol(Stan) > 0.942. Also rad(Stan) > rad($)/1.0017 > 
1.41/1.0017 > 1.4076, because rescaling Stan by a factor of 1.0017 gives a simplex 
containing S. This means that Stan Satisfies the hypotheses of the following calculation. 
Calculation 9.18.4 completes the proof of Lemma 9.18. 


Calculation 9.18.4. If rad(Sta => 1.4076 and 0.933 < sol(Stan) < 0.951385, then 
f (Stan) > 0. 


In this verification we may assume that the fourth, fifth, and sixth edges of Stan are at 
least 2.27, for otherwise the circumradius is at most 


rad(S(2, 2, 2, 2.27, 2.51, 2.51)) < 1.4076. 


In the verification of Calculation 9.18.4, we also rely on the fact that 


Fil%a, %5, X56) = f(S(2, 2, 2, ./%4, «/%5, »/%6)) 


is increasing in (x4, X%5,X6) € [2.277, 2.517]. Here is a sketch justifying this fact. The 
details were carried out in Mathematica with high-precision arithmetic. The explicit 
formulas of Section 8.6 lead to an expression for df; /0x4 as 


W (x4, X5, X6)(—X4 + x5 + X6) 
(—16 + x4)2A3/? 


> 


where W is a polynomial in x4, x5, and x6 (with 13 terms). To show that W is positive, 
expand it in a Taylor polynomial about (x4, x5, x6) = (2.277, 2.27%, 2.277) and check 
that the lower bound of inequality (7.1) is positive. 


Calculation 9.19. If y4 € [2, 2.1], then sol < 0.906. 


For a < b < c, we have vol(R(a, b, c)) = a((b? — a*)(c? — b*))'/?/6. The final 
four calculations are particularly simple (to the extent that any of these calculations are 
simple), since they involve a single variable. They were verified in Mathematica with 
rational arithmetic. 


Calculation 9.20.1. The function (1 — 1/a*) vol(R(a, n(2, 2, 2a), 1.41)) is increasing 
on [1, 1.15]. 


Calculation 9.20.2. The function vol(R(a, n(2, 2, 2a), 1.41)) is decreasing for a € 
[1, 1.15]. 
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Calculation 9.20.3. Fora € [1, 1.15] we have 


vol(R(a, n(2, 2, 2a), 1.41)) < vol(R(a, (2, 2.51, 2a), 1.41)). 


Calculation 9.21. Let p; = pi(y1,..., ye) be the point in Euclidean space introduced 
in Section 8.6.7. For y € [2.3, 2.51], 


d y? 1/2 a : y? 1/2 
— ( (2.51, 2.51, y) — — —0.75 > — , 2.51, 2, 2, 2, 2.51)|?7 — — : 
rm (n y) 7) > > a (ino I 7) 
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Appendix. Proof of Theorem 6.1 
D. J. Muder 


Notation and Observations. Let P be a point of degree d. If we consider P to be the 
center of the configuration, then the first rim of points will be the set of d points adjacent 
to P, and the second rim will be those at distance 2 from P. Let 6(P) be the sum of the 
degrees of the first rim points. If d = 6, it is easy to see that the number of points on the 
second rim is s = 5(P) — 24, and the total number of points of distance at most 2 from 
P is 5(P) — 17. The second rim is thus an s-gon. This s-gon and the triangulation of 
the P-side of the s-gon are referred to as the inner graph. The inner-graph degree of a 
second rim point is called its inner degree. The number of second-rim points with inner 
degree 5 is equal to the number of degree 4 points in the first rim, and the number of 
second-rim points with inner degree 3 is equal to the number of degree 6 points on the 
first rim. All other second-rim points have inner degree 4. Points beyond the second rim 
are called extra points. If there are no extra points, then there are at least two second-rim 
points whose degrees are equal to their inner degrees. Let Ni, (P) be the number of points 
of degree A in rimi around P. Notice that if d = 6, then 


5(P) = 30 — Nj4(P) + Ne(P). 


Further, Euler’s formula gives N — 12 = No — N4. Putting this together with 5.1.1 and 
5.1.5, we see 


13 < 12+ Neo —N4 < 15, 
1<14+Ng<No<34+N4 <5. 


Lemma 1. There is no triangle whose vertices all have degree 6. 
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Lemma 2. Suppose No > 3. Let Ge be the subgraph of points of degree 6. Either Ge 
is three points in two components, or it is a single (open or closed) path, with no other 
edges. 


With these lemmas and this notation, we consider the different possible values of Ne. 


No = 1. This forces N4 = 0. Let P be the degree 6 point. Then 5(P) = 30, all thirteen 
points are in the inner graph, and all six second-rim points have inner degree 4. However, 
at least two of these second-rim points have inner degree equal to degree. Contradiction. 


No = 2. If the two degree 6 points are adjacent, let P be either one. Now 5(P) = 31—Nj, 
and we see that there are no extra points, and all second-rim points have degree 5. All 
but Nj second-rim points have inner degree less than 5, and at least two of them will 
have inner degree equal to degree. So N4 > N, i = 2, and N < 12. Contradiction. 

If the degree 6 points are not adjacent, then the (6, 6) forces N > 14, so Ny = 0. If 
6(P) is either of the two degree 6 points, 6(P) = 30, so the second rim has six points, 
all of inner degree 4, and there is one extra point. If the extra point has degree 5, it is 
adjacent to all but one of the second-rim points. This unique second-rim point is then 
part of a quadrilateral, which can only be triangulated by the diagonal edge that does not 
include the extra point. This creates two second-rim points of degree 6, in violation of 
our assumptions. Therefore the extra point has degree 6, and we have Diagram 6.2. 


No = 3. Either Gg is an open path or it has two components. In either case there is a 
(6, 6),so N > 14and Ng = Oor 1. 

In the first case let P be the center of the path. Now 6(P) = 32— Nj. We see that there 
are no extra points and any degree 4 point is in the first rim. The second rim has at most 
one point of inner degree greater than 4, and two points whose inner degrees are equal 
to their degrees. So there must be a point of degree 4 on the second rim. Contradiction. 

In the second case 5.1.8 forces N = 15, hence Ny = 0. If P is one of the points in 
the two-point component of Gg, then 5(P) = 31, so there is one extra point and seven 
second-rim points. Either one or two of the second rim points is not connected to the 
extra point. In either case at least one of these two has degree = inner degree, which is 
at most 4. Contradiction. 


No = 4. Now Gg is either an open or closed path of length 4. If open, then, by 5.1.8, 
N = 15 and N, = 1. If P is either of the interior degree 6 points, then 6(P) = 32 — M4. 
the second rim is an (8 — Nj)-gon with Nj points of inner degree at least 5 and Nj extra 
points. We cannot have WN. i = 0, or else there would be at least two second-rim points 
of degree 4, so Nj = 1. This holds no matter which interior degree 6 point we started 
with, so the degree 4 point must be adjacent to both. It must also be adjacent to one of 
the other two, or else there is a (6, 6, 4). Therefore there is a degree 6 point whose first- 
rim degree sequence is 6/4/6/5/5/5, producing a second-rim inner degree sequence of 
4/3/5/3/4/4/4. The point of inner degree 5 is the fourth neighbor of the degree 4 point, 
and cannot be degree 6 without making G¢ closed. Therefore there is an edge connecting 
the two points of inner degree 3. Adding this edge to the inner graph creates a hexagon 
with all points of “inner” degree 4, and the extra point in its interior. The extra point is 
connected to five of these points, and the sixth has degree 4. Contradiction. 
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If Ge is a closed path, then a degree 4 point Q must triangulate this quadrilateral. 
Using Q as the center, there are four first-rim points and eight second-rim points. The 
second rim can be drawn in a square with the four points of inner degree 3 on the corners 
and the four points of inner degree 4 at the midpoints. There is at most one additional 
degree 4 point (other than Q). If it is anywhere but at one of the second-rim midpoints, 
a (6, 6, 4) exists. The four midpoints cannot be adjacent to any additional second-rim 
points without creating an illegal triangle or quadrilateral. Since there can be at most one 
more degree 4 point, at least three of the four second-rim midpoints must be joined to 
the extra point(s). Two midpoints of consecutive sides cannot be joined to the same extra 
point without forcing the corresponding corner to be degree 4, which is impossible. So 
there must be two extra points, and thus no degree 4 points other than Q. One extra point 
must be joined to the midpoints of each pair of parallel sides of the square. However, all 
these edges cannot be drawn without intersecting. 


No = 5. Now Gg is a path of length 5, closed so that a (6, 6, 6) does not exist. Any 
point in this pentagon is nonadjacent to two others, so the situation of 5.1.8 applies in 
five different ways. Let P, Q, R € Ge with P and Q adjacent, but neither adjacent to 
R. Then there are eight points adjacent to either P or Q, and six points adjacent to R, 
so there are exactly two common points in these two sets. In our case these two points 
are precisely the other two points of degree 6. This means that no point inside or outside 
the pentagon can be connected to more than two points of the pentagon. Thus the only 
possible configuration for the 15 points is to form three concentric pentagons, with Ge 
as the middle one. Prior to triangulating the inner and outer pentagons, all of their points 
have degree 4. Triangulating either creates more degree 6 points. Contradiction. 


Proof of Lemma 1. Let P,, Py, and P3 be the vertices of such a triangle. If P; has no 
neighbors of degree 4, then the only possibility is (P;) = 32, which forces N; = 4, 
Ng = 2, and N = 15. All second-rim points have inner degree at most 4, and there are 
no extra points. Therefore two second-rim points have degree and inner degree 4. Call 
them Q, and Q>. Therefore, N4 > 2, Ng > 5, and Ne > 2. Let R; and R> be second-rim 
points of degree 6. Each Q, and Ry must be adjacent, or else (P;, Re, Q,) is a (6, 6, 4). 
However, all edges from Q; are inner-graph edges. So Q; Ri Q2 Ro must be a second-rim 
quadrilateral, which is impossible. Thus Nj (P;) > 1 for each P;. 

This forces N4 = 2. At least one of the degree 4 points is adjacent to two of the 
P;. One of these two, say P;, is not adjacent to the other degree 4 point. Since 6(P) < 
32, we are left with four possible first-rim degree sequences for P;: (1) 6/6/4/5/5/5; 
(2) 6/6/4/6/5/5; (3) 6/6/4/5/6/5: or (4) 6/6/4/5/5/6. 

The last three possibilities are easily dealt with. Sequence (3) contains a (6, 6, 4). 
Sequence (4) creates two 6-6-6 triangles joined at an edge. Let P; and P2 be the vertices 
of the edge, and let P; and Py, be the other vertices of the triangles. The conditions 
Nj (P;) = 1 and N4 = 2 force P3 and Py, to have sequence (1). In (2) we can invoke 
5.1.8 with the three degree 6 points in the first rim. Call them R,, Ro, and 7, with R, 
adjacent to Ro. In order for the numbers to work out, there can be precisely two points 
which are adjacent to T as well as one of the R;. However, in (2) the center, the first-rim 
degree 4 point, and the second-rim neighbor of the first-rim degree 4 point must all fit 
this description. 


430 


Sphere Packings, I 31 


Sequence (1) is more difficult to eliminate. Let R,,..., Re be the first-rim points 
listed in the order of (1). Let S$; ...$7 be the second-rim points, with S; being adjacent 
to both R; and R2, and S7 having only R; as a first-rim neighbor. There is at least one 
degree 6 point in the second rim—f it is at S,, then 6(R,) = 32; if it is anywhere else 
we can invoke 5.1.8. In either case, N = 15 and there are two degree 6 points outside the 
first rim. These must lie in {S¢, $7, S,, S3} to avoid making a (6, 6, 4) with R; and R3. 
At most one of these points can be adjacent to R; (since 6(R;) < 33), so $3 must have 
degree 6. Now $3 must connect to both the degree 6 and degree 4 points in {S¢, 57, S;} to 
avoid making a (6, 6, 6) or (6, 6, 4) with the center point. However, 53 has inner degree 
5 and so can connect to at most one of those points. Contradiction. 


Proof of Lemma 2. If Ge has at least three points and no triangles, then there exists a 
(6, 6) forcing N > 14. Any point in Gg can be adjacent to at most two others, or else ei- 
ther a triangle or a (6, 6, 6) is created. Therefore each component is a path. There cannot 
be three or more components without producing a (6, 6, 6). If Gg has two components, 
each of them must be a complete graph for the same reason. Since there are no trian- 
gles, both of the components must have at most two points. If there are two components 
with two points each, then 5.1.8 forces N = 15. Each component is adjacent to eight 
non-Gg points of the original graph, and not adjacent to three. Now 5.1.8 forces each 
point in the other component to be adjacent to these three. However, two points can have 
only two common neighbors. So the only two-component Gg is the one described in the 
lemma. 
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Sphere Packings, II* 


T. C. Hales 


Department of Mathematics, University of Michigan, 
Ann Arbor, MI 48109, USA 


Abstract. An earlier paper describes a program to prove the Kepler conjecture on sphere 
packings. This paper carries out the second step of that program. A sphere packing leads to 
a decomposition of R? into polyhedra. The polyhedra are divided into two classes. The first 
class of polyhedra, called quasi-regular tetrahedra, have density at most that of a regular 
tetrahedron. The polyhedra in the remaining class have density at most that of a regular 
octahedron (about 0.7209). 


1. Introduction 


This paper is a continuation of the first part of this series [4]. The terminology and 
notation of this paper are consistent with this earlier paper, and we refer to results from 
that paper by prefixing the relevant section numbers with “I.” 

We review some definitions from [4]. Begin with a packing of nonoverlapping spheres 
of radius | in Euclidean three-space. The density of a packing is defined in [2]. Itis defined 
as a limit of the ratio of the volume of the unit balls in a large region of space to the volume 
of the large region. The density of the packing may be improved by adding spheres until 
there is no further room to do so. The resulting packing is said to be saturated. 

Every saturated packing gives rise to a decomposition of space into simplices called 
the Delaunay decomposition [8]. The vertices of each Delaunay simplex are centers of 
spheres of the packing. By the definition of the decomposition, none of the centers of the 
spheres of the packing lie in the interior of the circumscribing sphere of any Delaunay 
simplex. We refer to the centers of the packing as vertices. Vertices that come within 
2.51 of each other are called close neighbors. 

The Delaunay decomposition is dual to the well-known Voronoi decomposition. If the 
vertices of the Delaunay simplices are in nondegenerate position, two vertices are joined 
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by an edge exactly when the two corresponding Voronoi cells share a face, three vertices 
form a face exactly when the three Voronoi cells share an edge, and four vertices form a 
simplex exactly when the four corresponding Voronoi cells share a vertex. In other words, 
two vertices are joined by an edge if they lie on a sphere that does not contain any other of 
the vertices, and so forth (again assuming the vertices to be in nondegenerate position). 

We say that the convex hull of four vertices is a qguasi-regular tetrahedron (or simply 
a tetrahedron) if all four vertices are close neighbors of one another. If the largest 
circumradius of the faces of a Delaunay simplex is at most 2, we say that the simplex 
is small. Suppose that we have a configuration of six vertices in bijection with the 
vertices of an octahedron with the property that two vertices are close neighbors if and 
only if the corresponding vertices of the octahedron are adjacent. Suppose further that 
there is a unique diagonal of length at most 2./2. In this case we call the convex hull 
of the six vertices a quasi-regular octahedron (or simply an octahedron). A Delaunay 
star is defined as the collection of all quasi-regular tetrahedra, octahedra, and Delaunay 
simplices that share a common vertex v. 

We assume that every simplex S in this paper comes with a fixed order on its edges, 
1,..., 6. The order on the edges is to be arranged so that the first, second, and third edges 
meet at a vertex. We may also assume that the edges numbered i and i + 3 are opposite 
edges fori = 1, 2,3. We define S(y;,..., ye) to be the (ordered) simplex whose ith 
edge has length y;. If S is a Delaunay simplex in a fixed Delaunay star, then it has a 
distinguished vertex, the vertex common to all simplices in the star. In this situation, we 
assume that the edges are numbered so that the first, second, and third edges meet at the 
distinguished vertex. 

A function, known as the compression T'(S), is define on the space of all Delaunay 
simplices. Set Soc. = (—32 + 12 arccos(1//3))/V8 © 0.720903. Let S be a Delaunay 
simplex. Let B be the union of four unit balls placed at each of the vertices of S. Define 
the compression as 


T'S) = —dger Vol(S) + vol(S NM B). 


We extend the definition of compression to Delaunay stars D* by setting [(D*) = 
> T(S), with the sum running over all the Delaunay simplices in the star. We define 
a point (abbreviated pt) to be '(S(2, 2, 2, 2, 2, 2)) © 0.0553736. The compression is 
often expressed as a multiple of pt. 

There are several other functions of a Delaunay simplex that will be used. The dihedral 
angle dih(S) is defined to be the dihedral angle of the simplex S along the first edge (with 
respect to the fixed order on the edges of S). The solid angle (measured in steradians) 
at the vertex joining the first, second, and third edges is denoted sol(S). Let rad(S) be 
the circumradius of the simplex S. More generally, let rad(F’) denote the circumradius 
of the face of a simplex. Let n(a, b, c) denote the circumradius of a triangle with edges 
a, b, c. Explicit formulas for all these functions appear in Section I.8. 

Fix a Delaunay star D* about a vertex vg, which we take to be the origin, and we 
consider the unit sphere at vo. Let v; and v2 be vertices of D* such that vp, v;, and v2 
are all close neighbors of one another. We take the radial projections p; of v; to the 
unit sphere with center at the origin and connect the points p; and p2 by a geodesic arc 
on the sphere. We mark all such arcs on the unit sphere. The closures of the connected 
components of the complement of these arcs are regions on the unit sphere, called the 
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standard regions. We may remove the arcs that do not bound one of the regions. The 
resulting system of edges and regions will be referred to as the standard decomposition 
of the unit sphere. 

Let C be the cone with vertex vp over one of the standard regions. The collection 
of the Delaunay simplices, quasi-regular tetrahedra, and quasi-regular octahedra of D* 
in C (together with the distinguished vertex vo) will be called a standard cluster. Each 
Delaunay simplex in D* belongs to a unique standard cluster. 

A real number, called the score, will be attached to each cluster. Each star receives a 
score by summing the scores for the clusters in the star. 

The steps of the Kepler conjecture, as outlined in Part I, are: 


1. A proof that even if all standard regions are triangular, the total score is less than 
8 pt. 

2. A proof that the standard clusters with more than three sides score at most 0 pt. 

3. A proof that if all of the standard regions are triangles of quadrilaterals, then the 
total score is less than 8 pt (excluding the case of pentagonal prisms). 

4. A proof that if some standard region has more than four sides, then the star scores 
less than 8 pt. 

5. A proof that pentagonal prisms score less than 8 pt. 


The proof of the first step is complete. The other steps are briefly discussed in 
Part I. This paper establishes Step 2. Partial results have been obtained for Step 3 [5]. 
C. A. Rogers has shown that the density of a regular tetrahedron is a bound on the density 
of packings in R? [8]. The main result of this paper may be interpreted as saying that 
the density (Soc, * 0.7209) of a regular octahedron is a bound on the density of the 
complement in R? of the quasi-regular tetrahedra in the packing. 

The score of a Delaunay star is obtained by mixing Delaunay stars with the dual 
Voronoi cells. Delaunay stars D* and the associated function T behave much better than 
estimates of density by Voronoi cells, provided each Delaunay simplex in the Delaunay 
star has a small circumradius. Unfortunately, (S$) gives an increasingly poor bound 
on the density as the circumradius of the Delaunay simplex S increases. When the 
circumradius of S is greater than about 1.8, it becomes extremely difficult to prove 
anything about sphere packings with the function ['(S). The score is introduced to 
regularize the irregular behavior of I"(S). 

Voronoi cells also present enormous difficulties. The dodecahedron shows that a 
single Voronoi cell cannot lead to a bound on the density of packings better than about 
0.75. This led L. Fejes Toth to propose an approach to the Kepler conjecture in which 
two layers of Voronoi cells are considered: one central Voronoi cell and a number of 
surrounding ones. Wu-Yi Hsiang has made some progress in this direction, but there 
remain many technical difficulties [3], [7]. 

The method of scoring in this paper seeks to combine the best aspects of both ap- 
proaches. When the circumradius of a simplex is small, we proceed as in Part I. However, 
when the circumradius of a simplex is large, we switch to Voronoi cells. Remarkably, 
these two approaches may be coherently combined to give a meaningful score to De- 
launay stars and, by extension, a bound on the density of a packing. The calculations of 
this paper suggest that this hybrid approach to packings retains the best features of both 
methods with no (foreseeable) negative consequences. 
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2. Some Polyhedra 


Sometimes the tip of a Voronoi cell protrudes beyond the face of a corresponding De- 
launay simplex (see Diagram 2.1(a)). This section describes a construction that amounts 
to slicing off the protruding tip of a Voronoi cell and reapportioning it among the neigh- 
boring cells (see Diagram 2.1(b)). 


Diagram 2.1. (a) Voronoi cells and (b) reapportioned. 


Let D* be a Delaunay star with center vp = 0. Let V be the Voronoi cell around 
Vo, obtained by duality from D*. As a matter of convenience, we may assume that each 
point in R? belongs to a unique Voronoi cell by making an arbitrary choice for each 
point on the boundary of a cell. If R is a standard cluster (possibly a single quasi-regular 
tetrahedron) in D*, let C(R) denote its cone over vp: 


C(R) = {tx :t > 0,x € R}. 


In general, VM C(R) depends on more vertices than just those in the cluster R. It 
is convenient to consider the slightly larger polyhedron Ve defined by just the vertices 
of D* that are in R. That is, let Ve be the intersection of C(R) with the half-spaces 
{x 1x -+U; < vu; -u;/2,Vi ~ O}, where {v;}; are the vertices (other than vo) of the 
simplices and quasi-regular solids in the cluster R. The faces of ve at Up are contained 
in the triangular faces bounding the standard region of R. The other faces of ve are 
contained in planes through the faces of the Voronoi cell V. We refer to these as Voronoi 
faces. If R is not a quasi-regular tetrahedron, set Ve = ve. If R is a quasi-regular 
tetrahedron, we take the slightly smaller polyhedron Vr obtained by intersecting Ve 
with the half-space (containing v9) bounded by the hyperplane through the face of R 
opposite the origin vp. (This may cut a tip from the Voronoi cell.) By construction, Vp 
depends only on the simplices in R. The polyhedron Vz is based at the center of some 
Delaunay star, giving it a distinguished vertex v. We write Ve = Vr(v) when we wish 
to make this dependence explicit. 

By construction, Ve > VNOC(R). Itis often true that Ve = V1 C(R). Let us study 
the conditions under which this can fail. We say that a vertex w clips a standard cluster R 
(based at v1) if w ~ v; and some point of Ve (v;) belongs to the Voronoi cell at w. Part I 
makes a thorough investigation of the geometry when a vertex w clips a quasi-regular 
tetrahedron. (The vertex w must belong to a second quasi-regular tetrahedron that shares 
a face with §. The shared face must have circumradius greater than “f2, and so forth.) 
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Lemma 2.2. Let R, based at a vertex vo, be a standard cluster other than a quasi- 
regular tetrahedron. Suppose it is clipped by a vertex w. Then there is a face (vo, V1, V2) 
of R such that (w, vo, V1, V2) is a quasi-regular tetrahedron. Furthermore, (vo, V,, V2) 
is the unique face of the quasi-regular tetrahedron of circumradius at least /2. 


Proof. Consider a point p in Vr\V. Then there exists a vertex w ¢ C(R) of D* such 
that p-w > w- w/2. The line segment from p to w intersects the cone C(F’) of some 
triangular face F that bounds the standard region of R and has vo as a vertex. Let v; 
and v2 be the other vertices of F. By the construction of the faces bounding a standard 
region, the edges of F have lengths between 2 and 2.51. 

Consider the region X containing p and bounded by the planes H; = span(v,, w), 
Ay = span(v2, w), 43 = span(vi, v2), Hy = {x : x- vy = vy, - Vv, /2}, and Hs = {x : 
X + U2 = V2- V2/2}. The planes Hy and Hs contain the faces of the Voronoi cell at up 
defined by the vertices v; and v2. The plane H3 contains the face F. The planes Hy 
and H> bound the region containing points, such as p, that can be connected to w by a 
segment that passes through C(F). 

Let P = {x :x-w > w-w/2}. The choice of w implies that X 1 P is nonempty. We 
leave it as an exercise to check that X M P is bounded. If the intersection of a bounded 
polyhedron with a half-space is nonempty, then some vertex of the polyhedron lies in 
the half-space. So some vertex of X lies in P. 

We claim that the vertex of X lying in P cannot lie on H;. To see this, pick coordinates 
(x1, x2) on the plane Hy with origin vp = O so that v; = (0, z) (with z > 0) and 
XNA, C X!' := {(%,%2) : xy = 0.x. < 2/2}. See Diagram 2.3. If X’ meets P, 
then the point v;/2 lies in P. This is impossible, because every point between vo and 
v, lies in the Voronoi cell at vo or v,, and not in the Voronoi cell of w. (Recall that 
|vy — vo| < 2.51 < 2/2.) 


Diagram 2.3 


Similarly, the vertex of X in P cannot lie on H2. Thus, the vertex must be the unique 
vertex of X that is not on Hj; or Hy, namely, the point of intersection of H3, H4, and 
Hs. This point is the circumcenter c of the face F. We conclude that the polyhedron 
Xo := XM P contains c. Since c € Xo, the hypotheses of Lemma 1.3.4 are met for 
T = F, and the vertices vo, v1, v2, and w are the vertices of a quasi-regular tetrahedron 
S. By Lemma 1.3.4, |w’ — v;| < 2.3, fori = 0, 1, 2. The circumradius of the face F is 
between /2 and 2.51//3 © 1.449. 
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In the same context, if w and w’ both clip R, then the regions they cut from Vr(uo) are 
disjoint. For otherwise, acommon point would belong to both ve (w) and Ve (w’), where 
S and S’ are the two quasi-regular tetrahedra constructed by the lemma. Section I.3 shows 
that S and S’ share their unique face of circumradius greater than s/2. This is impossible, 
because the lemma states that this face is shared with R. 

Although the polyhedron Xo belongs to the Voronoi cell at w, it is included in the 
polyhedron Vp. Similarly, by repeating the construction at v; and v2, we find that there are 
small regions X,, X> (with vertex c) in polyhedra Vr, and Vp, at v; and vo, respectively, 
that belong to the Voronoi cell at w. 

Call the union Xo U X, U X>2 the fip protruding form the quasi-regular tetrahedron 
S. Associated with a quasi-regular tetrahedron is at most one such tip. (The tip must 
protrude from the face of S with circumradius greater than /2.) By construction, the tip 
is the set of points 


{x : |x —w| < |x —v,|, fori = 0, 1, 2; det(x, v1, v2) det(w, v1, v2) < O}. 


This is V°(w)\Vs(w). 

The tip is a subset of the Voronoi cell at w. Section I.3 explains the conditions under 
which this can fail to hold. There must be another vertex u ~# w with the property that 
ju — v;| < 2.3, fori = 0,1,2. Then u, vo, vj, and v2 are the vertices of a second 
quasi-regular tetrahedron S’ with face F’, and this contrary to our assumption that R is 
not a quasi-regular tetrahedron. 


Corollary 2.4. The polyhedra Vr cover R? evenly as we range over all the standard 
clusters of all the Delaunay stars of the packing. 


Proof. The preceding analysis shows that the polyhedra Veg are obtained from the 
Voronoi cells by taking each protruding tip, breaking it into three pieces Xo, X1, X2, and 
attaching the piece X; to the Voronoi cell at vj. The Voronoi cells cover R? evenly. As a 
result of this analysis, we see that the polyhedra Vr cover R? evenly. 


To give one example of the size of the tip, we consider the extreme case of the tetra- 
hedron S = S(2, 2, 2, 2.51, 2.51, 2.51). Diagram 2.5 shows a correctly scaled drawing 
of a tip protruding from the largest face of S. 


Diagram 2.5 
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3. The Score Attached to a Delaunay Star 


This section gives some rules for computing the score. They were developed as a result 
of computer experimentation suggesting when it is advantageous to use Voronoi cells 
over Delaunay simplices. This section actually gives an entire family of scoring systems. 
This extra bit of flexibility will be useful as we encounter new examples in the remaining 
steps of the program. We expect the score to satisfy Conjecture [.2.2, which asserts 
that the score of a Delaunay star is at most 8 pt, for all the scoring systems satisfying 
Properties 1—4 below. The Kepler conjecture is true if Conjecture I.2.2 holds for any one 
such scoring system. We have found through experimentation that small or seemingly 
innocent changes in the score can lead to enormous changes in the complexity of the 
optimization problem. 


3.1. This paper proves the second step of the program for all of the scoring systems 
presented below. Write o (S) for the score of S. 

1. Suppose that the standard cluster R is a single quasi-regular tetrahedron: R = S. 
When the circumcenter of S is contained in S, 


—4bocr VOI(Vr M C(S)) + 


4 sol($) 
3 


is an analytic function of the lengths of the edges. This expression has an analytic 
continuation, denoted vor(S, Vr), to simplices S that do not necessarily contain their 
circumcenter. 

If rad(S) > 1.41, then define the score to be vor(S, Ve). Ifrad($) < 1.41, then define 
the score to be the compression I(S). (This rule agrees with the definition of vor(S) 
given in Section [.2.) 

2. Let S be a small simplex that is not a quasi-regular tetrahedron. The score of S 
will be either vor(S) or (5) depending on criteria to be determined by future research.! 
These criteria may depend on whether S belongs to a quasi-regular octahedron, but not 
on the position of any vertices of the packing outside S. It is essential for the scoring 
at all four vertices to have the same type (Voronoi or compression). The only constraint 
imposed by the second step of the Kepler conjecture will be o (S$) < 0, if S is small. This 
leads to the following mild restrictions on the use of Voronoi scoring. 

If one of the first three edges is the long edges (say the first), compression scoring is 
to be used if the second, third, and fourth edges have length at most 2.06, and the fifth 
and sixth edges have length at most 2.08. 


If one of the last three edges (say the fourth) is the long edge, compression scoring is 
to be used if (a), (b), (c), and (d) hold. 


(a) The first edge has length at most 2.06. 


' More generally, we might add a small constant c to the score of S at one of its vertices and subtract the 
same constant from another vertex. 
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(b) The second and third edges have length at most 2.08. 

(c) The fifth and sixth edges have length at most 2.2. 

(d) The fourth edge has length at most 2.58, or the fifth and sixth edges have lengths 
at most 2.12. 


3. Suppose that R is any standard cluster other than a quasi-regular tetrahedron. 
The cluster is a union of Delaunay simplices S),..., S,. Index the simplices so that 
Si, ..., Sp, for some p < r are the small simplices in the cluster. We define the score of 
the cluster R to be 


» a(S;) + > vor(S;, Vr), 


l<i<p p<isr 


where vor(S, Vr) = 4(—do¢ Vol(Vr MN C(S)) + sol(S)/3). 

4. If D* is a Delaunay star, then its total score o(D*) is a sum of the scores of the 
standard clusters of D*. 

Consider the quasi-regular tetrahedron S$ of Section 2 with vertices vo, v1, v2, 
and w that has a protruding tip Xo U X; U X2. Let sol, (S) denote the solid angle of S$ 
at the vertex v. The analytic continuation vor(S, Vr) has the following geometric 
interpretation. 


4sol,,(S) 


vor(S, Va(v)) = —48ocr(vol(S, Va(v)) + A(v)) + 3 


with the correction term A(v;) = — vol(X;), fori = 0, 1, 2, and 
3 
A(w) = )° vol(X;). 
i=l 


The only pieces that are compression scored are small simplices, everything else is 
Voronoi scored. The small simplices that are compression scored will be called simplices 
of compression type. The Voronoi-scored small simplices will be called simplices of 
Voronoi type. We define the restricted cell of a cluster R to be the complement in Vr of 
the small simplices in the packing. 


Lemma 3.2. 


(1) The score of a cluster depends only on the cluster, and not on the way it sits in a 
Delaunay star or in the Delaunay decomposition of space. 

(2) Let A denote the vertices of a saturated packing. Let Ay denote the vertices 
inside the ball of radius N. (Fix any center for the ball.) Let D*(v) denote the 
Delaunay star at v € A. Then the score satisfies (in Landau’s notation) 


die D*w)) = DITD*@)) + O(N”). 
An 


An 


Proof. Statement (1) holds by construction. 
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(2) The score reapportions the compression of a given star among surrounding stars. 
The second part of the lemma follows from the claim that everything is accounted for, 
if we ignore the boundary effects caused by the truncation N. Space is partitioned into 
regions each counted —46,.; times by the compression of some star. Each point in a 
sphere of the packing is counted four times by the compression of some star. To verify 
Lemma 3.2(2), we must check that the same holds of the score. 

We switch from Voronoi to compression scoring on certain small simplices. The 
faces F of a small simplex S satisfy rad(F) < 2, so no point on a face F of S can 
be closer to another vertex in the packing than it is to the closest vertex of F’. This has 
two implications. First, the only polyhedra Vz meeting a small simplex S are the four 
based at the vertices of S. Second, let R be a standard cluster. Let S$ be a small simplex 
in R. Then Ve 1 C(S) = Vr S. (In other words, tips cannot protrude from a small 
simplex.) This means that the restricted cells and small simplices cover space evenly. 
This decomposition is compatible with the standard decomposition of a Delaunay star. 

Consider the rules defining the score. In counting the part of the volume of a sphere 
contained in a simplex S, we see that it appears four times with weight | for a total 
weight of 4, when S is a small simplex of compression type. It appears once with weight 
4 for a total weight of 4, when S is of Voronoi type. 

The result is now clear. 


Remark 3.3. It is useful to summarize the proof from a slightly different point of view. 
If S is a quasi-regular tetrahedron or a small Delaunay simplex, then the sum of its four 
scores, for each of its four vertices, is 41°(S). This follows directly from the definitions 
(and the proof of Lemma 3.2) if the circumcenter of S is contained in S (which is always 
the case for small simplices), and it follows by analytic continuation in general. Any 
other point in space belongs to a unique Voronoi cell centered at some vertex v. If the 
point is not in a tip protruding from a quasi-regular tetrahedron, it is counted in the score 
at v. If, however, the point belongs to a protruding tip, it is counted in the score at exactly 
one of the three vertices, other than v, of the quasi-regular tetrahedron. In this way, every 
point in R? is accounted for. 


Remark 3.4. The choice of the parameter xp = 1.41 in Section 3.1(1) is somewhat 
arbitrary. The choice is based on the comparison of the functions 


filx) = vor(S(2, 2, 2, 2.51, 2.51,x), Vs) and fo(x) =P (S(2, 2, 2, 2.51, 2.51, x)). 


The difference f\(x) — fo(x) has a zero for some x € [2.2603, 2.2604]. This gives 
a crude estimate of when it is advantageous to switch from I'(S) to vor(S, Vs). The 
constant 1.41 is a little more than rad($(2, 2, 2, 2.51, 2.51, 2.604)) + 1.405656. 


Proposition 3.5. The Delaunay stars in the face-centered cubic and hexagonal-close 
packings score 8 pt. 


Proof. The eight regular tetrahedra each score | pt, and each regular octahedron scores 
0 pt, because it has density 6,.;, for a total of 8 pt. 
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We will see in Proposition 4.6 that the regular octahedron can be broken into smaller 
pieces that score 0 pt. 


4. The Main Theorem 


Theorem 4.1. 


(a) The score of any small quasi-regular tetrahedron is at most | pt. 
(b) The score of any other standard cluster is at most 0 pt. 


Proof. Statement (a) is a special case of Calculation I.9.1. A quasi-regular tetrahedron 
of Voronoi type scores less than 0 pt by Lemma 1.9.17. In the remainder of the proof, we 
actually prove a much stronger statement. We explicitly decompose each cluster (other 
than a quasi-regular tetrahedron) into a number of pieces and show that the density 
of each piece is at most 6,-. Since vor(S, Vr) and I'(S) are zero precisely when the 
corresponding densities are 6,,; (or when the volumes are zero), the theorem will follow. 
The relevant pieces will be congruent to one of the following types: 


1. A small simplex that is not a quasi-regular tetrahedron. 

2. Aset {tx :O0<t<1lxe Pic IR?, where P» is a measurable set and every 
point of P2 has distance at least 1.18 from the origin (Diagram 4.2(a)). 

3. Aset{tx:0<t<1,x € P3} Cc R°, where P; is a wedge of a disk of the form 


: 2.12 
P3 = {(X1, X2, 3) 1X3 = Zo, Xp + xz < 2,05 x) < xj}, 


for some a > 0 and some | < Zo < 1.18 (Diagram 4.2(b)). 
4. A Rogers simplex R(a, b, /2) where 1 < a < 1.18 and $ < b* < 2 (see 
Section I.8.6 and Diagram 4.2(c)). 


Diagram 4.2 


In the first type, a unit ball is placed at each vertex of the simplex S, and the density 
is the ratio of the volume of the part of the balls in S to the volume of S. In the second, 
third, and fourth types, a unit ball is placed at the origin, and the density is the ratio of 
the volume of the part of the ball in the region to the volume of the region. 

We decompose all of RR? into these four types and quasi-regular tetrahedra. Set all the 
quasi-regular tetrahedra aside. Classify all the small simplices, including those contained 
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in a quasi-regular octahedron, as regions of the first type. There remain the restricted 
cells. Now fix a Delaunay star D*, with center at the origin, and consider the restricted 
cell of one of the clusters in the star. We may assume that the restricted cell does not lie in 
a quasi-regular tetrahedron. Break the restricted cell up further by taking its intersection 
with the cones over each of its Voronoi faces F’. Let X be one such intersection. If the face 
F has distance more than 1.18 from the origin, classify X as a region of the second type. 
Now assume the face F' has distance f at most 1.18 from the center. Because h < «/2, 
the point in the plane of F closest to the origin lies on the face F. The set of points P, 
on the face F at distance greater than ./2 from the origin gives rise to a region of the 
second type. To study what remains, we may truncate F by intersecting it with a ball of 
radius /2. Let F’ C F be the truncated face. 

By Voronoi—Delaunay duality, the face F’ lies in the bisecting plane between 0 and 
some vertex v of the Delaunay star. Consider the collection of triangles formed by 0, 
v, and another vertex of the Delaunay star D*, with the property that either the triangle 
has circumradius at most 2 or all three edges of the triangle have lengths between 2 
and 2.51. Consider the half-planes (bounded by the line through 0 and v) containing the 
various triangles in this collection. This fan of half-planes partitions the face F’ into a 
collection of wedge-shaped pieces. Consider one of them F”. We claim that is has the 
form of Diagram 4.3. 


Diagram 4.3 


More precisely, F” is bounded by two triangular faces F, and F» (in this collection 
of triangles), two edges e; and e> of the Voronoi cell dual to the triangles, and an arc a 
obtained from the truncation. The two edges e; and e2 are perpendicular to the faces F; 
and F, respectively, by the definition of Voronoi—Delaunay duality. The edges e; and e2 
meet the faces F; and F2, respectively, by the construction of restricted cells. The edges 
e; and e> cannot intersect at any point less than /2 from the origin, because the point of 
intersection would be a point equidistant from the four vertices of a simplex formed by 
the vertices of F; and Fy. The simplex would have circumradius less than /72. Its faces 
would then also have circumradius less than 2, so that the Delaunay simplex is small. 
This is impossible, since all small simplices have already been classified as regions of 
the first type. 


445 


146 T. C. Hales 


Lemma 4.4. In this context, assume that the faces F, and Fz form an acute angle, and 
let p be the point at which the line through e, meets the plane through Fy. Let w, and 
wW» be the third vertices of the faces F, and Fy, respectively (that is, those other than 0 
and v). If the distance from 0 to p is at most V2, then the simplex (0, v, w1, w2) is small 
or a quasi-regular tetrahedron. 


Proof. Suppose that the distance from p to 0 is at most /2. Let c; be the circumcenter 
of F,. Let S be the simplex (0, v, w,, w2). 

We claim that p lies in the interior of the triangle F). The bisecting line ¢ between 0 
and v in the plane of F> contains p. The line € intersects two edges of Fr, once at v/2 
and once at some other point p’. If p is (strictly) outside F>, then |p| > |p’|. This leads 
to a contradiction, once we show |p’| > V2. If p’ lies on the edge between v and wy, 
this is clear, because an elementary exercise shows that every point on the line passing 
through w and v has distance at least /2 from the origin. Assume p’ lies between 0 
and w2, and consider p’ as a function of w2 and v. Its length | p’| attains its minimum 
when F> is the right triangle |v| = 2, |v — w2| = 2, |w2| = 2V2. Thus, |p’| > V2. 

So p lies in the interior of F>. The vertex w; has distance at most /2 from p. No 
vertex w, can come within \/2 of an interior point of F, unless the circumradius of F is 
at least /2. If the circumradius of F, is J2, then p is the circumcenter of S, so that the 
circumradius of S is /2, making S a small simplex. If the circumradius is greater than 
./2, then since | p — w;| < V2, p lies in the Voronoi cell at w;. Thus, w; clips (possibly 
degenerately) a standard region across the faces Fy from w1, based at 0, v, or w2. By 
Lemma 2.2 (w1, 0, v, w2) is a quasi-regular tetrahedron. 


We continue with our description of the figure in Diagram 4.3. The arc @ cannot be 
interrupted by a further (Voronoi) edge of F’. Such an edge would be dual to a (Delaunay) 
face with vertices 0, v, and some v’. The circumradius of the triangle with these three 
vertices would be less than /2 (because every edge in F’ comes within distance /2 of 
the origin). This contradicts the construction of F” with half-planes given above. This 
completes our discussion of the figure in Diagram 4.3. We emphasize, however, that the 
edges e; or é2 may degenerate to length 0, and the circular arc @ may degenerate to a 
point. 

This Voronoi face-wedge can be broken into three convex pieces: the convex hull of 
0, v/2, and the circular arc, and the convex hulls of 0, v/2, and the edge e;, fori = 1, 2. 
The first piece has the third type, the others have the fourth type. The boundary condition 
: < b? expresses the fact that the circumradius of a triangle with sides of length at least 
2 cannot be less than 2./2/3. This completes the reduction to the four given types. 

Now we must show that each of the given types has density at most doer. 


Type 1. A small simplex that is not a quasi-regular tetrahedron. Let S be a small 
simplex of Voronoi type with at least one edge longer than 2.51. By the monotonic- 
ity properties of the circumradius, we know that the circumradius of S is at least 
rad($(2,2,2,2.51,2,2)) > 1.3045. Let d(a, b,c) denote the density of the Rogers 
simplex (R(a, b, c) (see Lemma 1.8.6). By Roger’s lemma (1.8.6(2)), the six Rogers 
simplices Vs have density less than 5,-; and vor(S, Vs) < 0 if the circumradius of the 
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three faces is at least 1.207(5(1, 1.207, 1.3045) < 6,.;). This condition on the circum- 
radius of the faces holds whenever there are two edges longer than 2.51 at the origin 
(n(2.51, 2,2) > 1.207) or whenever there are two oppositely arranged edges longer 
than 2.51. 


Thus, to show that vor(S) < 0 for small simplices of Voronoi type, we must consider 
the following cases: (1) one edge longer than 2.51, (2) two adjacent edges longer than 
2.51, and (3) three edges longer than 2.51 meeting at a vertex. These cases are covered 
by Calculation 4.5.2. In (1), we may assume that at least one of the conditions for 
compression scoring in Section 3.1 fails to hold. In Calculation 4.5.2(2), we may make 
the stronger assumption rad(S$) < 1.39 for otherwise, the Rogers simplices at the origin 
have density at most 6(1, 7(2, 2, 2.06), 1.39) < doc: so that vor($) < 0. 

We rely on Calculation 4.5.1 for small simplices of compression type. The appendix 
proves the result for simplices in an explicit neighborhood of $(2V/2, 2, 2, 2, 2, 2). These 
calculations are established by methods of interval arithmetic described in Part I. Source 
code appears in [6]. 


Calculation 4.5.1. If S is a small simplex that is not a quasi-regular tetrahedron, then 
I'(S) < 0. If equality is attained, then the simplex S is congruent to § (2/2, 2,2, 2:2, 2) 
or to the simplex of zero volume (S(2/2, 9.2 97.9. 2). 


Calculation 4.5.2. Assume S is small. vor($(y1,...,¥6)) < Oif y1,..., ¥6) belongs 
to any of the cells (1)-(11). Let J denote the interval [2, 2.51] and L = [2.51, 2/2]. 


(1) L[2.06, 2.51]/4. 
(2) LI?[2.06, 2.51]/?. 
(3) LI7[2.08, 2.51]/. 
(4) [2.06, 2.51]/2LI’. 
(5) 1[2.08, 2.511 L727. 
(6) PL[2.2, 2.5111. 
CO) F258, 24/2112. 12, 25117, 
(8) LPLI. 

(9) LIPL*. 

(10) 277. 

(11) PL. 


Type 2. The set {tx : 0 < t < 1,x € Py}. In this case the density is increased 
by intersecting the set with a ball of radius 1.18 centered at the origin. The resulting 
intersection has density 1/ 1.187 < Soe, as required. 


Type 3. Theset {tx :0 <t < 1, x € P3}. The bounding circular arc of P3 has distance 
./2 from the origin. The set has the same density as a right circular cone, with base a 
disk of radius /2 — h? and height h. This cone has volume (2 — h?)h/3. The solid 
angle at the apex of the cone is 27 (1 — cos 9), where cos 9 = h/./2. This gives a density 
of /2 / (h2+ hvV/2). This function is maximized over the interval [1, 1.18] ath = 1. The 
density is then at most 2 — J/2 < Oncts 
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Type 4. A Rogers simplex R; = R(a, b, /2), where | < a < 1.18 and ; <b? <2. 


By Lemma I.8.6(2), the density of this simplex is at most that of the Rogers simplex 
Ro = RC, 2/3 13% sf). This simplex has the density 6,.; of a regular octahedron. (In 
fact, the regular octahedron may be partitioned into simplices congruent to R2 and its 
mirror.) We see that the original simplex R, has density 6,,; exactly when, in the notation 
of Lemma I.8.6(2), |s;| = |S2|, for all 41, A2, and A3 as above. This implies that a = 1 
and b = 2,/3/3. This completes the proof of Theorem 4.1. 


Proposition 4.6. A cluster other than a quasi-regular tetrahedron attains a score of 0 
pt if and only if it is made up of simplices congruent to S(2, 2, 2, 2, 2, 2/2), and possibly 
some additional simplices of zero volume. 


Proof. Types 2 and 3 always give strictly negative scores for regions of positive volume. 
According to Calculation 4.5, a region of the first type with positive volume gives a strictly 
negative score unless it is congruent to S(2, 2, 2, 2, 2, 24/2). 

Consider a region of the fourth type with score 0 pt. We must have a = 1| and 
b = 2,/3/3. The circumradius of the faces F; and F) of Diagram 4.3 is then 2\/3/3. 
This forces the faces F; and F> to be equilateral triangles of edge length 2. The arc @ in 
Diagram 4.3 must reduce to a point. The edges e; and eg in Diagram 4.3—if they have 
positive length—must then meet at a point at distance 2 from the origin. This point is a 
vertex of a Voronoi cell and the circumcenter of a Delaunay simplex S (of circumradius 
/2). The only simplex with two equilateral faces of side 2 and rad(S) = V2 is the 
wedge of an octahedron S = S(2, 2, 2, 2, 2, 24/2). This is a small simplex. 

The other possibility is that both the arc w and an edge (say e2) degenerate to length 
0. In this case, Lemma 4.4 shows that the restricted cell belongs to a small Delaunay 
simplex or a quasi-regular tetrahedron. These cases have already been treated. 


Appendix 


We give a direct argument that ['(S') < 0 pt, when the lengths of a small simplex S 
are within 0.001 of So = S(2V2, 2, 2, 2, 2, 2). Set S = S(y1, yo, y3, Ya, Ys, Yo). Write 
y= 2/2 — fi, and y; = 2+ f; fori > 1, where 0 < f; < 0.001. Set x; = yz = 8-e, 
and x; = y? = 4+¢e;, fori > 1. Then 0 < e; < 0.006. Recall from Section I.8.4 that 


A(Y1, Yas «++ Yo) = Yiyayst gyi Og +y3— Yat 5 y2(Vi +3 — YS) +5301 +99 — YE) 
Set 


Ay = a(Y1, V2, Y3, V4, Y55 Yo)s ayy = a(2/2, 2, 2, 2, 2,2) = 16 + 12/2, 
a1 = a(y1, Y5s Y6s Yas Y2s Y3)s ayo = a(2V2, 2, 2, 2, 2,2) = 16+ 12V2, 
ay = a(y4, 2, Yo, Yi V5, 3), 20 = a2, 2, 2, 2/2, 2, 2) = 16, 
a3 = a(y4, Ys, 3, Yi, 2 Yo), ~~ 30 = (2, 2, 2, 2/2, 2, 2) = 16. 


Section I.8.4 and the bounds on f; give a; > a; , where dy) = a; = 32.27 anda, = 
a, = 15.3. 
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Let A be the function of Section I.8.1, and set Ag = A(8, 4, 4, 4,4, 4). Set ¢ = 
VS A(x1,..-,X6)/2, and to = Ao /2. A simple calculus exercise shows that 


A(x1,.-.,X6) = A(8, 4, 4, x4, 4, 4) = 128 — 8e3. 


This gives f > 5.628. Let bj = 2/(3(1 + 12/a2,)), so that by = by = (3 + 2V2)/9 and 
by = b3 = 16/27. Set cy = —8per/6 + 23 bj /ajo © —0.00679271. Then 


(A — Ao)co —2eico 2 
< < 0.002e2 < 0.0002 fy. 
A(t+%) — +t) — . fs 


(t — to)co = 
We are ready to estimate (5). An argument parallel to that of Lemma I.9.1(1) gives 


cy eee Be pl qin ae qaye 
iMe2rwyee-nas ay wy 


i=0 aig i=0 G94; 


The two sums on the right-hand side are polynomials in f; with no constant terms. To 
give an upper bound on these polynomials, write them as a sum of monomials, and 
discard the negative monomials of order greater than 2. The positive monomials of order 
greater than 2 are dominated by 


: i als - < (O001E Te Fi és h cere 6 fs). 


This approximation shows that the first sum in (1) is at most —0.005 f; — 0.04 fy — 
0.03( fo + f3+ fs+ fe) and the second sum in (1) is at most 0.00056( fi + fo+---+ fe). 
The result easily follows. 

This argument is easily adapted to a neighborhood of S$; = (2/2, Dy. 2/2, 2,2). 
In this case, fori = 1,...,4, we have aj) = 16+ 8/2, b, = x, a; = 27, to = 0, 
co © —0.0225, and t > 0. A similar arguments leads to the conclusion that '(S) < 
T'(S,) = 0 pt, if S is a small simplex such that S$ # Sj, and the lengths of the edges of S 


are within 0.01 of those of Sj. 
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a(-), 281 

anc = anc(y;, y2, ye), 173 
A = A(h), 132 

A = A(v), 192 

Ag = Ao(d), 122 

A; = Aj(S,c, v), 126 
A(t, y), 260 

Ar(t, y), 261 

a(n), 233, 281 


be = 0.253095, 242 
b(p, q), 281 

bg = 0.49246, 242 
B(x,r), 87, 124 


c-vor, 131, 173 
CALC-123456789, 141 
c(n), 281 

C(V), 146 

C'(v), 146 

C(h, t), 131 

c-vor(-), 131 

cap;, 124 

crown(h), 173 


d(n), 281 
DS, 89, 118 
DS°, 117 
DS(G), 309 
D(v, A), 118 
dih(S), 163, 267 
dih(T), 246 


E(S, S'), 98 


J.C. Lagarias (ed.), The Kepler Conjecture: The Hales-Ferguson Proof, 


E(Y1, 2, --++ ¥6)s 98 


f (flute), 257 
Gfccs 139 

Ghep; 139 

G(D), 139 

gma(S) (= I'(S)), 243, 267 


m = 0.3621, 242, 267 


octavor(-), 244 


pt= in — 12 arccos ( 


90, 383 


) ~ 0.0553736, 


ol- 


q (quoin), 256 
Qo: 145 

Qy, 111 
quo(R), 123 


rad(S), 152, 159 
Ry, 154 
R = R(a, b,c), 123, 131 


sol(-), 124 

sol(R), 288 

sol($), 238, 261 

sol(T), 246 

s-vor, 125, 132 

s-vorg, 125 

s(v, C), 279 

S01, y2,---5 ¥6), 181, 185 


tgt = 14.8, 280 
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to = 1.255, 90 
tri(v), 280 


U(D), 90, 120, 138 
U(v, A), 90 


v, 257 

V(t), 259 

V(t), 259 
vor(-), 244 

vor p(-, +), 132 
vor Rr, 130, 131 
vorg, 77 

voro, p(D), 196 
vors (X), 203 
vor(T, v), 45, 54 
VC(V), 108, 149 
VC(D), 149 


W, (wedge) 152 
W®, 153 

w = w(F), 281 

w, w’ (wedges), 256 


Z(n, k), 198 


Tr, 192 
T(S), 125, 383, 385 


Sloop (+ *)s 217, 224 
Soct © 0.720903, 35, 74, 90, 139 
Stet, 74, 90 

8(v, W°), 153, 173 

8(x,1r, A), 87 

A, 185 

A(v, W°), 153 

A~(v, W°), 154 


€y(A, x), 154 
é/,(A, x), 154 


= 1 
2 arctan( 2 ) 


n = n(x, y, z), 100 
nt (Q), 126 
no(h), 152 


= 14.7947, 280, 288 


Kn = Vol(S;,), 32 
«(S), 181 


A, 202 
A, A(x,r), 87 


WT, v), 46 


v(Q), 181 


X = x1, %2,...,%6), 110 
xo0(S), 203 


E, = 0.01561, 188 
E/ = 0.00935, 188 

& = —0.029, 188 

Ex, = —0.01339, 188 
éy = 0.003521, 188, 192 
E(y, 0), 425 


7 = 0, 008, 189 
1p, 320 


o(S), 441 

o =o0(S,c, v), 126 
o(D), 90, 129 

o(T), 246 
oyR(F), 49 

oR = oR(D), 130, 165 


t(D), 165, 288 

tr (D), 233 

TR(D), 165, 288 
TLP(p, q), 169, 290 


Q(D) (Voronoi cell), 89, 120 
Q(V) (Voronoi cell), 88, 107, 119 


0.01561, 192 
0.0553736, 383 
1.453, 152 
2.51, 75 
14.7947, 280 
14.8, 280 
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adjacent, 279 cone, 131, 140 

pair, 101 conflicting diagonal, 102 
admissible (weight assignment), 281 context, 317 
aggregation, 293 of a quarter, 122 
analytic continuation, 212 contravening, 91 
anc, 173 decomposition star, 91, 229 
anchor, 96, 317 contravening plane graph, 277, 294 
anchored simplex, 317 convex, 200, 208 
arc sequence, 208 spherical region, 202 
archival list of graphs, 91 corer, 101, 293 
axis (of pivot), 98 concave, 200 

convex, 200 
ball, 87, 124 visible, 200 
barrier, 107 corrected volume, 88 
base point, 101 cross, 97 
boundary edges, 202 crowded, 318 
3-crowded, 4-crowded, 184, 318 

c-vor, 131 crown, 138 
CALC- 123456789, 141 cycle, 279 
cap, 124 length, 279 
central (vertex), 193, 292 
circle packing, 78 decomposition star (DS, D(v, A)), 89, 118 
circuit, 280 contravening, 91 
circumradius, 100 decoupling lemma, 114 
cluster degree (of a vertex), 280 

quad, 130, 139 density (Q(D)), 120 

standard, 130 Delaunay tesselation (triangulation, 
colored decomposition) 12, 36, 382 

points, 121 diagonal (of quarter), 95 

space, 121 dih (dihedral angle), 97 
compression, 125 dihedral cutoff, 246 

type, 127 dimensional reduction, 244 
computers, 79 
concave, 200, 208 edge 
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distinguished, 196 

nonexternal, 196 

of a plane graph, 279 

parameters, 198 
enclosed, 101 

vertex, 300 
erase, 180, 317, 319 
exceptional, 280 

cluster, 180 

face, 280 

standard region, 180 
extremal (quarter), 141 


face, 279 

-centered cubic, 68, 91 
fec-compatible, 88 
Ferguson, 92, 237 
first edge (of simplex), 97 
flat quarter, 317 
flute, 256, 258 
formal proof, 


gaps, 317 
geometric considerations, 97 


height (of vertex), 101 


hexagonal-close packing (HCP), 10, 68, 91 


hybrid decomposition, 76 


interpretation, 309 
interval arithmetic, 264 
isolated, 102 

pair, 102 


Kepler conjecture, 5, 87, 90 
kissing number, 8, 70 


labels 

edge, 97 
law of cosines, 144 
length (of a cycle), 279 
linear programming, 308, 359 
local optimality, 138, 151 
loop, 317 
LP (linear program), 308, 359 
LP derived inequality, 309 
mask, 320 
masked (quarter), 187 


n-cycle, 279 


negligible, 88, 128 
nonlinear inequalities, 352 
NLP (nonlinear program), 308 


obstructed, 107, 146 
octahedron, 96, 247, 248 
orientation (of simplex), 110 
oriented plane graph, 279 
orthosimplex, 122 

overlap, 96 


packing, 87 

saturated, 31, 87, 95 
pair 

isolated, 102 
passes through, 100 
patch, 284 
penalty, 319 
pentahedral cap, 238, 240 

PC, 241 

strict, 241 

prism, 76, 92, 94, 237-239 
pivot (of vertex), 98 
planar graph, 279 
plane graph, 279 
point (pt), 90, 129, 141 
projection (of a set), 97 
proper isomorphism, 280 


Q-system, 94, 96, 108 
quad cluster, 130, 170, 171 
mixed, 263 
obtuse, 263 
squashed, 253, 257 
truncated, 252 
quadrilateral, 280 
quarter, 40, 141, 316 
flat, 95, 127, 192, 247, 317 
masked, 187 
strict, 95, 126 
upright, 95, 127, 317 
quartered octahedron, 96 
quasi-regular 
tetrahedron, 40, 77, 95, 141 
triangle, 95, 107, 126 
quoin, 123, 256, 258, 259 


r-vor, 131 
radial projection, 97 
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relaxation (of NLP), 308 translation-invariant, 97 
right circular cone, 131 truncated 
Rogers simplex, 123, 131, 139, 255, 256 corner cells (tcc), 180, 203, 213 
Voronoi 
satisfaction, 309 function, 76 
saturated (packing), 31, 87, 95 scoring, 252 
score, 76, 89, 122, 129, 131, 240 truncation parameter (f9 = 1.255), 90, 95 
scoring function, 89 type (of a vertex), 280, 290 
separated set, 281, 301 
simplex unconfined, 318 
special, 196 3-unconfined, 184 
solid angle, 124 upright, 317 
standard region, 113 
strict V-cell (VC), 108, 149 
diagonal, 95 vertex, 87, 95, 120, 279 
PCP pentahedral prism, 241 distinguished, 97, 139 
subdivision, 265 enclosed, 101 
subregion, 320 vor, 130, 131 
successor, 279 Voronoi 
cell, 88, 89, 107, 119, 120 
tame (graph), 91, 279, 282, 356 scoring, 245 
target (tgt = 14.8), 280 type, 127 
total weight, 282 
triangle, 143, 219, 280 wedge, 152, 255 
triangular standard region, 113 weight assignment, 280 
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p.i Image courtesy History of Science Collections, University of Oklahoma Li- 
braries. 


p. 4 The Six-Cornered Snowflake translated by Hardie (1966) Title page. By per- 
mission of Oxford University Press. 


p. 6 Image courtesy of the Division of Rare and Manuscript Collections, Cornell 
University Libraries. 


p. 7 Image courtesy of the Division of Rare and Manuscript Collections, Cornell 
University Libraries. 
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